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cgagagacgacagaaggttacggctgcgagaagacgacagaagggtccagaaaaa 

ggaaagtgctggaggggagtggggacaaaagcagcga<x:aagtgaatgtcacttc 

agtgactgaggccaggcaaaacgcgcgggaaggattttgtgtagcttgggaccctt 

tcatagacactgatgacacgtttacgcaaaatagaaatttgaggagaaacgcxn'gg 

gccttcggaaaggagtgattgattagtactrgcaagtttaggtgacrttaaggagaa 

ctaactaatgtatactattgagggaggaggaagagcattacagagtttccagcagc 

agcaggaaagctttggttaatttggaaatggatgatagcattaaaataacagaagc 

gccrccaggtctctgaagcttcagtcccccagctgaaagccagaaaagactaagcc 

cactaagcctt1tgatccctttggaagc 

tctcctcagaagatttcctgtctctgkdctatgttacaagaggaatcaaaaccaagac 

agaagagctcaggatgcaggtgagaggcagggaagtcagcggcttgttgatctccc 

tcactgctgtctgcctggtggtcacccctgggagcagggcctgtcctcgccgctgtg 

cctgctatgtgcccacagagkstgtcactgtacatttcggtacctgacctccatcccag 

atggcatcccggccaatgtggaacgaataaatttaggatataacagccttactagat 

tgacagaaaacgactttgatggcctgagcaaactggagttactcatgctgcacagt 

aatggcattcacagagtcagtgacaagaccttctcgggcttgcagtccttgcaggtc 

ttaaaaatgagctataacaaagtccaaatcattcggaaggatactttctacggactc 

gggagcttggtccggttgcacctggatcacaacaacattgaattcatcaaccctgag 

gcctltratggacttacctcgctccgcttggtacatttagaaggaaaccggctcaca 

aagcrccatccagacacatttgtctcattaagctatctccagatatttaaaacctctt 

tcattaagtacctgttcttgtctgataacttcctgacctccctcccaaaagaaatggt 

ctcctacatgccaaacctagaaagcctgtatttgcatggaaacxrcatggacctgtg 

ctgccatttaaagtggttgtctgagtggatgcagggaaacccagatataataaaat 

gcaagaaagacagaagctcttccagtcctcagcaatgtcccctttgcatgaacccca 

ggatctctaaaggcagaccctttgctatggtaccatctggagctttcctatgtacaa 

agccaaccattgatccatcactgaagtcaaagagcctggttactcaggaggacaat 

ggatctgcctccacctcacctcaagatttcatagaacc(nttggcrccttgtctttga 

acatgacananntntctggaaataaggccgacatggtctgtagtatccaaaagcca 

tcaaggacatcaccaactgcattcactgaagaaaatgactacatcatgctaaatgc 

gtcattttccacaaatcttgtgtgcagtgtagattataatcacatccagccagtgtg 

gcaacttctggctttatacagtgactctcctctgatactagaaaggaagccccagct 

taccgagactccitcactgtcitctagatataaacaggtggctctraggcctgaaga 

catititaccagcatagaggctgatgtcagagcagaccctritrggltccaacaaga 

AAAAATTGTCTTGCAGCTGAACAGAACTGCCACCACACTTAGCACATTACAGATCCA 

GTTTTCCACTGATGCTCAAATCGCTTTACCAAGGGCGGAGATGAGAGCGGAGAGAC 

TCAAATGGACCATGATCCTGATGATGAACAATCCCAAACTGGAACGCACTGTCCTGG 

TTGGCGGCACTATTGCCCTGAGCTGTCCAGGCAAAGGCGACCCTTCACCTCACTTGG 

AATGGCTTCTAGCTGATGGGAGTAAAGTGAGAGCCCCTTACGTTAGCGAGGATGGG 



7-.* * r ' iv 

CGAATCCTAATAGACAAAAATGGGAAGTTGGAACTGCAGATGGCTGACAGfS^GX 

TGCAGGTCTTrACCACTGCATAAGCACCAATGATGCAGATGCGGATGTTCTCACATA 

CAGGATAACTGTGGTAGAGCCCTATGGAGAAAGCACACATGACAGTGGAGTCCAGC 

ACACAGTGGTTACGGGTGAGACGCTCGACCTrCCATGCCTTTCCACGGGTGTTCCAG 

ATGCTTCTATTAGCTGGATTCnTCCAGGGAACACTGTGTTCTCTCAGCCATCAAGAG 

ACAGGCAAATTC1TAACAATGGGACCTTAAGAATATTACAGGTTACGCCAAAAGAT 

CAAGGTCATTACCAATGTGTGGCTGCCAACCCATCAGGGGCCGACITITCCAGTTTT 

AAAGTTTCAGTTCAAAAGAAAGGCCAAAGGATGGTTGAGCATGACAGGGAGGCAG 

GTGGATCTGGACTTGGAGAACCCAACTCCAGTGTTTCCCTTAAGCAGCCAGCATCTT 

TGAAACTCTCTGCATCAGCTTTGACAGGGTCAGAGGCTGGAAAACAAGTCTCCGGTG 

TACATAGGAAGAACAAACATAGAGAOTAATACATCGGCGGCGTGGGGATTCCACG 

CTCCGGCGATTCAGGGAGCATAGGAGGCAGCTCCCTCTCTCTGCTCGGAGAATTGAC 

CCGCAACGCTGGGCAGCACTTCTAGAAAAAGCCAAAAAGAATTCTGTGCCAAAAAA 

GCAAGAAAATACCACAGTAAAGCCAGTGCCACTGGCTGTTCCCCTCGTGGAACTCA 

CTGACGAGGAAAAGGATGCCTCTGGCATGATTCCTCCAGATGAAGAATTCATGGTTC 

TGAAAACTAAGGCTTCTGGTGTCCCAGGAAGGTCACCAACTGCTGACTCTGGACCAG 

TAAATCATGGTTTTATGACGAGTATAGCTrCTGGCACAGAAGTCTCAACTGTGAATC 

CACAAACACTACAATCTGAGCACCTTCCrGATTTCAAATTATTTAGTGTAACAAACG 

GTACAGCTGTGACAAAGAGTATGAACCCATCCATAGCAAGCAAAATAGAAGATACA 

ACCAACCAAAACCCAATCATTATCTITCCATCAGTAGCTGAAATTCGAGATTCTGCT 

CAGGCAGGAAGAGCATCTTCCCAAAGTGCACACCCTGTAACAGGGGGAAACATGGC 

TACCTATGGCCATACCAACACATATAGTAGCTTTACCAGCAAAGCCAGTACAGTCTT 

GCAGCCAATAAATCCAACAGAAAGTTATGGACCTCAGATACCTATTACAGGAGTCA 

GCAGACCTAGCAGTAGTGACATCTCTTCTCACACTACTGCAGACCCTAGCTTCTCCA 

GTCACCCTTCAGGTTCACACACCACTGCCTCGTCTITATITCACATTCCTAGAAA 

CAATACAGGTAACTrCCCCTTGTCCAGGCACTTGGGAAGAGAGAGGACAATTTGGA 

GCAGAGGGAGAGTTAAAAACCCACATAGAACCCCAGTTCTCCGACGGCATAGACAC 

AGGACTGTGAGGCCAGCAATCAAGGGACCTGCTAACAAAAATGTGAGCCAAGTTCC 

AGCCACAGAGTACCCTGGGATGTGCCACACATGTCCTTCCGCAGAGGGGCTCACAG 

TGGCTACTGCAGCACTGTCAGTTCCAAGTTCATCCCACAGTGCCCTCCCCAAAACTA 

ATAATGTTGKKjGTCATAGCAGAAGAGTCTACCACTGTGGTCAAGAAACCACTGTTAC 

TATTTAAGGACAAACAAAATGTAGATATTGAGATAATAACAACCACTACAAAATAT 

TCCGGAGGGGAAAGTAACCACGTGATTCCTACGGAAGCAAGCATGACTTCTGCTCC 

AACATCIXjTATCCCTGGGGAAATCTCCTGTAGACAATAGTGGTCA(XTGAGCATGCC 

TGGGACCATCCAAACTGGGAAAGATTCAGTGGAAACAACACCACTTCCCAGCCCCC 

TCAGCACACCCTCAATACCAACAAGCACAAAATTCTCAAAGAGGAAAACTCCCTTG 

CACCAGATCTTTGTAAATAACCAGAAGAAGGAGGGGATGTTAAAGAATCCATATCA 

ATTCGGTTTACAAAAGAACCCAGCCGCAAAGCTTCCCAAAATAGCTCCTCTTTTACC 

CACAGGTCAGAGTTCCCCCTCAGATTCTACAACTCTCTTGACAAGTCCGCCACCAGC 

TCTGTCTACAACAATGGCTGCCACTCAGAACAAGGGCACTGAAGTAGTATCAGGTG 



3/90 



ccagaagtctctcagcagggaagaagcagcccitcaccaactcct 

ctagcaccataagcaagagatctaatacattaaacttcttgtcaacggaaaccccca 

cagtgacaagtcctactgctacixkatctgtcattatgtctgaaacccaacgaacaa 

gatccaaagaagcaaaagaccaaataaaggggcctcggaagaacagaaacaacgc 

aaacaccacccccaggcaggtttctggctatagtgcatactcagctctaacaacagc 

tgatacccccttggctttcagtcattccccacgacaagatgatggtggaaatgtaag 

tgcagttgcitatcactcaacaaccrctcttctggccataactgaactgtttgagaa 

gtacacccagactttgggaaatacaacagctttggaaacaacgttgttgagcaaat 

cacaggagagtaccacagtgaaaagagcctcagacacaccaccaccactcctcagc 

agtggggcgcccccagtgcccactccttccccacctccttttactaagggtgtgg^ 

acagacagcaaagtcacatcagctttccagatgacgtcaaatagagtggtcaccat 

atatgaatcttcaaggcacaatacagatctgcagcaaccctcagcagaggctagcc 

ccaatcctgagatcataactgk5aaccactgactctccctctaatcrgttrccatccac 

ttctgtgccagcactaagggtagataaaccacagaattctaaatggaagccctctcc 

ctggccagaacacaaatatcagctcaagtcatactccgaaaccattgagaagggca 

aaaggccagcagtaagcatgtccccccacctcagccttccagaggccagcactcat 

gcctcacactggaatacacagaagcatgcagaaaagagtgtttttgataagaaacc 

tggtcaaaacccaacttccaaacatctgccttacgtctctctacctaagactctattg 

aaaaagccaagaataattggaggaaaggctgcaagcittacagttccagctaattc 

agacgtttttctrcctrgtgaggctgttggagacccactgcccatcatccactggacc 

agagtttcatcagganttgaaatatcccaagggacacagaaaagccggttccacgt 

GCTTCCCAATGGCACCTTGTGCATCCAGAGGGTCAGTATTCAGGACCGTGGACAGTA 

CCTGTGCTCTGCATTTAATCCACTGGGCGTAGACCATTTTCATGTCTXJrTTGTCT 

GTTTTTTACCCGGCAAGGATTTTGGACAGACATGTCAAGGAGATCACAGTTCACTTT 

GGAAGTACTGTGGAACTAAAGTGCAGAGTGGAGGGTATGCCGAGGCCTACGGTTTC 

CTGGATACTTGCAAACCAAACGGTGGTCTCAGAAACGGCCAAGGGAAGCAGAAAGG 

TCTGGGTAACACCTGATGGAACATTGATCATCTATAATCTGAGTCTTTATGATCGTG 

GTTTTTACAAGTGTGTGGCCAGCAACCCATCTGGCCAGGATTCACTGTTGGTTAAGA 

TACAAGTCATCACAGCTCCCCCTGTCATTATAGAGCAAAAGAGGCAAGCCATCGTTG 

GGGTTTTAGGTGGAAGTTTGAAACTGCCCTGCACTGCAAAAGGAACTCCCCAGCCTA 

GTGTTCACTGGGTCCTTTATGATGGGACTGAACTAAAACCATTGCAGTTGACTCATT 

CCAGATTTTTCTTGTATCCAAATGGAACTCTGTATATAAGAAGCATCGCTCCTTCAGT 

GAGGGGCACTTATGAGTGCATTGCCACCAGCTCCTCAGGCTCAGAGAGAAGGGTAG 

TGATTCITACTGTGGAAGAGGGAGAGACAATCCCCAGGATAGAAACTGCCTCTCAG 

AAATGGACTGAGGTGAATTTGGGTGAGAAATTACTACTGAACTGCTCAGCTACTGG 

GGATCCAAAGCCTAGAATAATCTGGAGGCTGCCATCCAAGGCTGTCATCGACCAGT 

GGCACAGAATGGGCAGCCGAATCCACGTCTACCCAAATGGATCCTTGGTGGTTGGG 

TCAGTGACGGAAAAAGACGCTGGTGACTACTTATGTGTGGCAAGAAACAAAATGGG 

AGATGACCTAGTCCTGATGCATGTCCGCCTGAGATTGACACCTGCCAAAATTGAACA 

GAAGCAGTATT1TAAGAAGCAAGTGCTCCATGGGAAAGATTTCCAAGTTGACTGCA 
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AGGCCTCTGGCTCCCCTGTGCCTGAGGTATCCTGGAGTTTGCCrGATGGGACAGTGC 

TCAACAATGTAGCCCAAGCTGATGACAGTGGCTATAGGACCAAGAGGTACACCCTT 

TTCCACAATGGAACCTTGTATTTCAACAACGTTGGGATGGCAGAGGAAGGAGATTAT 

ATCTGCTCTGCCCAGAACACCTTAGGGAAAGATGAGATGAAAGTCCACCTAACAGT 

TCTAACAGCCATCCCACGGATAAGGCAAAGCTACAAGACCACCATGAGGCTCAGGG 

CTGGAGAAACAGCTGTCCTTGACTGCGAGGTCACTGGGGAACCGAAGCCCAATGTA 

TTTTGGTTGCTGCC1TCCAACAATGTCATTTCATTCTCCAATGACAGGTTCACATTTC 

ATGCCAATAGAACTITGTCCATCCATAAAGTGAAACCACTrGACTCrGGGGACTATG 

TGTGCGTAGCTCAGAATCCTAGTGGGGATGACACTAAGACATACAAACTGGACATT 

GTCTCTAAACCTCCATTAATCAATGGCCTGTATGCAAACAAGACTGTTATTAAAGCC 

ACAGCCATTCGGCACTCCAAAAAATACTTTGACTGCAGAGCAGATGGGATCCCATCT 

TCCCAGGTCACGTGGATTATGCCAGGCAATATTTTCCTCCCAGCTCCATACTT^ 

AGCAGAGTCACGGTCCATCCAAATGGAACCTTGGAGATGAGGAACATCCGGCTTTC 

TGACTCTGCGGACTTCACCTGTGTGGTTCGGAGCGAGGGAGGAGAGAGTGTGTTGGT 

AGTGCAGTTAGAAGTCCTAGAAATGCTGAGAAGACCAACATTCAGAAACCCATTCA 

ACGAAAAAGTCATCGCCCAAGCTGGCAAGCCCGTAGCACTGAACIXjCTCTGTGGAT 

GGGAACCCCCCACCTGAAATTACCTGGATCTTACCTGACGGCACACAGTTTGCTAAC 

AGACCACACAATTCCCCGTATCTGATGGCAGGCAATGGCTCTCTCATCCTTTACAAA 

GCAACTCGGAACAAGTCAGGGAAGTATCGCTGTGCAGCCAGGAATAAGGTTGGCTA 

CATCGAGAAACTCATCCTGTTAGAGATTGGGCAGAAGCCAGTCATTCTGACATACGA 

ACCAGGGATGGTGAAGAGCGTCAGTGGGGAACCGTTATCACTGCATTGTGTGTCTG 

ATGGGATCCCCAAGCCAAATGTCAAGTGGACTACACCGGGTGGCCATGTAATCGAC 

AGGCCTCAAGTGGATGGAAAATACATACTGCATGAAAATGGCACGCTGGTCATCAA 

AGCAACAACAGCTCACGACCAAGGAAATTATATCTGTAGGGCTCAAAACAGTGTTG 

GCCAGGCAGTTATTAGCGTGTCAGTGATGGTTGTGGCCTACCCTCCCCGAATCATAA 

actacctacccaggaacatgctcaggaggacaggggaagccatgcagctccactgt 

gtggccttoggaatccccaagccaaaagtcacctgggagacck:caagacactccct 

gctctcaaaagcaacagcaagaaaaccccatagaagtgagatgcttcacccacaag 

gtacgctggtcattcagaatctccaaacctcggatrccggagtctataagtgcagag 

ctcagaacctacttgggactgattacgcaacaacttacatccaggtactctgacagg 

aagggggagactaaaattcaacagaagtccacatccacagggtttattttttggaa 

gaagtttaatcaaaggcagccataggcatgtaaatgagtctgaatacatttacagt 

attaaatttacaatggacatgcgatgagacttgtaaatgaaagcattgtgaactga 

aaccgagtctctgtggatctcaaagcaaactcttaacttaaggcactttgattttgc 

caacaaataataacaaacattaagagaaaaaaatgatccactacgaaataacaaac 

ggctaatgcacctgaattctcagtaaaaagacctttctctcgctaacagttgccagc 

tgcctcgtgtctgtttcctaccaatgtcacaaacatcgcacacagggtgaatggagt 

caacgggaaagattaagtttgcggtctgtgtaaatctcaatgtacaaatattctgtc 

nctggtttataaacattttgataaaaccgaaaaaaaaaaaaaaaaaaaaaaaaaa 

A AAA (SEQ ED NO: 1) 
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MQWGREVSGmSLTAVCLVVTPGSRACPRRCACYVPTEVHCTFRYLTSIPDGIPANVE 

RJ2sfLGYNSLTRLTENDFIXjLSKl.ELLMLHSNGmVSDKTFSGLQSLQVLKMSYNKVQn 

RTOTFYGLGSLVRLHLDHNNffiFW^ 

FKTSFKYLFI^DNFLTSLPKEMVSYMPNIJ^ 

IKCKKDRSSSSPQQCPI£MNPRISKGRPFAMW^ 

ASTSPQDFffiPFGSL^LNMTXXSGNKADMVCSIQKPSRTSPTAFTEENDYIMLNASFSTNL 
VCSVDYNfflQPVWQLLALYSDSPLD-ERKPQLTETPSLSSRYKQVALRPEDlFTSIEADVR 
ADPFWQQEKIVLQL^TATTLSTLQ^^ 
LFJITVLVGGTIAI^CPGKGDPSPHI^WLIA^ 

DSFDAGLYHCISTNDADADVLTYPJTTVVEPYGESTHDSGVQHTVWGETIJ3IPCLSTGV 
PDASBWILPGNTVF^QPSRDRQIL^GTmiLXJVTPKDQGHYQCVAAJ^SGADFSSFKV 
SVQKKGQRM\^EHDRFACK3SGLGEPNSSVSLKQPASLKLSASALTGSEAGKQVSGVHRK 
NKHRDLIHRRRGDSTl^RFREHRRQLPI^ 

KPVPLAVPLVELTDEEKDASGMIPPDEEFMVLKTKASGVPGRSPTADSGPVNHGFMTSI 
ASGTEVSTVNPQTLQSEHLPDFKLFSVTNGTAWKSMNPSMSKffiDTTNQNPIIIFPSVAE 
IRDSAQAGPvASSQSAHPWGG^MATYGHWTYSSFTSKASTVLQPINPTESYGPQIPff 
SRPSSSDISSHTTADPSFSSHPSGSMTASSII^RhMm}^ 

KNPmTPVLRRHRmTVRPAIKGPAMONrVSQWATEYPGMCHTCPSAEGLTVATAALS 
VPSSSHSALPKTNNVGVIAEESTTWKJCPLL^ 

ASMTSAPTSVSLGKSPVDNSGHl^MPGTIQTGKDSVETTPLPSPLSTPSIPTSTKFSKRKTP 

LHQIFVNNQKKEGMLKNPYQFGLQKNPAAKLPK1APLLPTGQSSPSDSTTLLTSP 

TMAATQNKGTEVVSGARSLSAGKKQPFTNSSP^ 

ASVMSFJXJRTRSKEAKDQIKGPRKNRNNANTO 

RQDDGGNVSAVAYHSTTSLLAITELFEKYTQT^ 

PPLLSSGAPPVPTPSPPPFTKGVVTDSKVTSAFQMTSmVWIYESSPJINTDLQQPSAEAS 
PNPEirTGTTDSPSNLFPSTSWAl^VDKPQNSKWKPSPWPEHKYQLKSYSETIEKGKRPA 
VSMSPHI^LPEASTHASHWNTQKHAEKSVFDKKPGQN^ 

GKAASFTVPANSDVELPCEAVGDPLPIIHWTRVSSGXEISQGTQKSRFirTLPNGTLSIQRV 

SIQDRGQYLCSAFNPLGVDIlHWSLSVVFYPAPJLDRIiVKEITVHFGSTVELKCRVEGMP 

RPTVSW1LANQTVVSETAKGSRKWVTPDGTLIIYNLSLYDRGFYKCVASNPSGQDSLL 

VHQVITAPPVIffiQKKQAIVGVLGGSLKLPCTAKGTPQPSVHWVLYDGTELKPLQLTHS 

RFFLYPNGTLYIRSlAPSVRGTYECIATSSSGSERRVVILTVEEGETIPPJF/rASQKWTEVN 

IXrEKLIiNCSATGDPEIRIIWRLPSKAVroQWIlRMGSRIHVYPNGSLVVGSWEKDAGD 

YLCVARNKMGDDLVLMHVRLRLTPAKIEQKQYFKKQVLHGKDFQVDCKASGSPVPEV 

SWSIJ'DGTVLNNVAQADDSGYRTKRYTLFHNGTLYFNNVGMAEEGDYICSAQOT 

DEMKVHLTVLTAIPR1RQSYKTTMRLRAGETAVLDCEVTGEPKPNVFWLLPSNNVISFS 

bTOPJFTFHANRTI^IHKVKPLDSGDYVCVAQWSGDDTKTYKLDIVSKPPLINGLYANKT 

VIKATA1RHSKKYFDCRADGIPSSQVTWIMPGN1FLPAPYFGSRWVHPNGTLEMRNIRLS 



DSADCTCVVRSEGGKVLVVQLEVLEMLRRPTFRNPFNEKVIAQAGKPVALNCSVDGNP 
PPEITWIIPDGTQFANRPHNSPYIJ^GNGSLILYKATEINKSGKYRCAARI^ 

LEIGQKPVILTYEPGMVKSVSGEPI^LHCVSDGIPKPNVKWTTPGGHVIDRPQVDGKYIL 

HENGTLVIKATTAHDQGNYICRAQNSVGQAWSVSVMWAYPPRIINYIPRNMLRRTGE 

AMQLHCVAD3IPKPKWWETPRHSLLSKATARKPmSEMLHPQGTLVIONLOTSDSGVY 
KCRAQNLLGTDYATTYIQVL (SEQ ED NO: 2) 
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ATG(irac573) 



Kat-608 5TTTR 



Rat-608 OKF (7791bp) [ 



^ Homology 



<23 300 




Bac23-26*1L4 

(iWCl-S'cxkl) 
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Noel 

1nrrcxttAcc^ax&<U7cx^^ 

TCCAATGGAATACTACTX^CrTATTAAAA^^ 

GNATCXnClxm<3aATCnX3A<m^ 
ACTCACttUTAAGlX^ 
TTITraTCnOTArAATATrAAQAIXKJAAAT^ 
TOTnCAQTOGAmCOQGMCSAAMTMTAAOl^ 

A<3Km3ACAAAOTAlCa^(^CA^ 
<mTT<>TATrAACTCCAGTTraXATCT^^ 
AACrrCA<XAOCTTtXnxCACA qAG<^OTTACTATOCrr^ 
C^rfAA<KTOCACm!UArATQAGTt?rflAC mm- 

TAATmAT<OTAQAAATOT^Ar ^ ^ 
ATn- M<XXXXX^ACTA(XK^TAArriTAATC^ A 

y^ A ^5^ G rcrc«n*^ 

AMTAGOA<UAACKUOCTTAfl AATA<^^ 

^^GQTVA^TAAQAOGTOOGAA A<W^TAnTO^ArATCTAQAAnTA A A AT^ Arr?^ 

A ffiAQCOATPCTXXMAAATAOT^^^ 

<XUGCTOA<XflTCT(^AflGCr ^ 

OAQCHTAGICTXX?AAGGAG(nT1tlG^ 

(^TATOL\ATATrACTAArrTO rACOCAAG^ 

£TOAimA<HAAGCATATAAA QTXUOAGCA<re^^ 

1^^^^^^^^^ ^^ATTTTraTAATOCATC^raAA 
AAAfiC r nAOAAATriTXOT^ 

^O^TGTATCA<X^TAAfiATty! AAOCTrA&AAA<ffl 
AACAACAACGACAArAAA^i^ 

g AAA ^ff A ;™^ ^ 



ATA<lAOAOAATAAAaKUTAflAT AC<nTre^ 

im-ACACCTCAAAGATAOATAACCAACAAAOAGOAAQAGOATAQGTOOC^^ 
<XUA<&grACATTCXiAAA<ntAG^^ 

4Noc2817 

AAAAAAAAAAAAAAAAAAAA^^ 
<3K^WkCAGAAG<K30AGTC^CAAACCCAGOAC^ 
TCHTATTGCIXSTTOrrCT 
CTATGACATGTTTTTTTATTCAAGTATATX^ 
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CKXX^OCCCCCTITO^ 

CACACACACACAC^CACACACACACX^^ 

CACCX^GCTAATOCTTH^^ 

CCCCX^XXXXXl<nTAGGAqATCKK3A 

GCCAQ0C0GG(KX^CAGAGCC1XKX^CCA 

<XUflGAGCAAACTCGCAOCKX3^^ 

flAQCQOCTGCTMTCC^^ 

(XTTGGGGTTC^GGGCGAQACAGAGGGTGGGAT^^ 

ACTG<^<2GCKU£rriTK^ACA^^ 

GG€AAAAG<^<XX^AAGGATTATAX^^ 

TlGlXX*AAGC\A<X*ATr^^ 

A^AClTCTAAGTTTAfKJTGAGTTI^CAACTAACr 

A<^TKX^<XMOCAGC&GC^^ 

ATGTATGATAOCATTAAAATAACAGA 

<mUA<UCTAAGCClACTAAGCC^^ 

TGAAG<X7FCIXXTCAGAAGATrr<^ 

AGAAGA£CTCACOTATTG<XAACTCTrT 



* *fac4203 

TOTTAAATACAOTTTGAGGCITAAGTGTACGi^ 



CATCrrTGTrTATCrrAATAATA 

CAAAAATATTCAGTOAttHAACAG^ 

TTAATTTOGGTTAAACTGATTCATAAC^ 

CACGTGOG^TTAAA^TATITTCT 

CCATCACTGTGfrCAGTAAGCAGAQQGGT^^ 

TGCCTOCAAMTCAGAGAC^ 

AGTTOOCKTOAAAAGGCCK^^ 

OAAATTTGAOCTTmTTCrAGTGCKITTATrAC^ 

CCAAATAAATAAAACACATATTAAATAATA^^ 

GTACATTCATAAAKKl^TGTCAAAAA 

TTTTCTAAATTGGACATGTACTATITrrATATT^ 

AGTQTAAAACKK^TCTTGGTCAAACAAGAGAGTTAAAT^ 

AACATTATTATQATCrrCTAATQACAGGGATCCTGCTT^ 

AAQATATCrrrACAATAATAAQCCCATTTAGTGATAAACm 

AClXXflTTATAAA<MGAACAGTI^^ 

AQTTOCTCTTTCTOCAGATra 

CATTATAAQTIXKiAATAAGATAGGATCTCA^ 

TAATXTTTrrGK^OriCIYATQGAATr^^ 

ACCTCroAAACTAGAACTAOCAATOCATrAClXKKnATGTA 

OAATTTATOiCTACATTffrTCAT ^ 

(KXXnX^AAAATATGAAOGGATAAAGAAAATGTGGAAATGTCT 

<X%ATAAAAQAQT<3AAA1X&TOACATAT^ 

<KXrTAATCAAAACAAGAGAG^^ 

GAA<CTAOATTTA<^CHX^^ 

ATTCTAATCTGTGAGTGTQTATAAACK^t^ 

TCKK3TAGGAGAGAAAGGAATQAGAGGAQTTAATAAGAAC 

CCA<X3ATGAAATGCATrAArrrCTATXynAACTAAA^ 

AACCACXXJIXjAAATCCCAOCAC^^ 

CAAOAAQ<XATrrGCAGTTGCTG<XTGCTO^ 

ACKnXnATAACAAC£AGTTOACAATACAAAGTIX3 

Tm^mTnTlCTTACTOTTT^^ 

ATTTGTTTCTTITGAGAGAGAGAAGGAACATG^ 

ATCTXK3AAgA*GTKXKKM^ 

AACAAACAAAAGGTTXXTTTrGCCACAAAAAGQTg 

CTTAAAGAAMTATTCCCAATTATT^^ 
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CTTOAACTQCKXCTATAACTCI^ 
TCCATCCCAGACXX5<^TCXX^^ 

ooAccrnKxrrGATCKx^nx^^ 

AAACACCTOlKMTACTmAA^ 

TGOAC^CAATQAACAmTATXXX^TAGTCTm 

GtTIX3GGAG<XTCATAATrAlXKKXKA^ 

ACAGTTTCLGTTAACAATCATrATG^ 

AAAAGCaTTTXHTITGCAATTTATaAATAC^^ 

CClTACrAAATCKiACAAAm 

ATAAATCK^CAAQAAATOOOATATG(X!ATC^ 

ATGCCAAACACTATlGfln^ 

TATAACCOTATAOX^GT^^ 

ClTXCTATOAOOaAGGGGCCTO^^ 

GKKAGGimATGATAACTTOA^ 

GAC^CATGCCTOAAGAA<3^T^ 

TGATTCATOGGG(^<K3GCCe^^ 

AAAAAAAAAC^GGCTGAGCAAGTGTCAAGGAGCAAGT^^ 

TCinClXK^TC^OCItXrix^ 

C^GTOATCAACAATOATGKKiAA 

TTXKjTCATGATGTT^ 

CCnTQGACTAAATIX^TGTTATCTATAGCn^ 

CTACACCAGAGTGTAATCTGGGCC^ 

GGGGCTTCnAACCGGAACACAGT^ 

TGTOGACTITAATIGACGAlTCATClXj^ 

CTCACTAGATTGACAGAAiATOAC^^ 

GC4ATGG<^TTC^CAGAGTCAGTGACAA<^CCT 

NacS21$ A 

GTGAGATAGGTAGAGGGTeATGGAGGCmAGAAGAGAGGT^^ 
AAAACTCTOCIGAnoCCGTCGGA 

k^ggaag<3atttcaattgtaatc<^ 

ctggggaaaatgcttacacaatrgcgaggacctaaot 

aaagcacagoatggcggca<ucaccigga^ 

c(x^gagactcattgg<xa^ 

acttgactcagaaactaatgtg<^aagcatc 

<^tgcatoaataagggatxxxagagagaa<kk3aagaaaaaggaag 

aa<3gaaggaaggmggaaggaaggaaggaaggaagag^ 

aaggaaggaaagggaaaggaaaaaagagatggggagggagggaaggaaag 

ggagaaagaa<ugaagaaaggaaaataaataaatttit^g 

ttttatcxxraaaaggtcatttxx^ 

aagggcagkotaac&tagtagca^ 

TTTITATCCrcrACCATCTrCAAACTGAAACTAC^ 

TGTMGAGTTnX^CACACraAAGTGGGAAACTrAAG^ 

AAT<KXXUGAACAAAAACAGACTG<^C&(^^ 

TIGAGAAGCTAAITGACXKnXK^TGACTACTCA^ 

ATGAAAAGC^CMTAAAGTACATATGltXX^TAA 

ATAATAGCAAICAAAGCAAXA^ 

ACTATGTCTA<nGTTTTGACTTAGGOT^ 

ATITA<>GTCATCAGACAG<^TITGGAACATC 

GC^TGAAATTOFCXACCTA^ 

GGGTTTGAGGTTTTACra^ 

CITmTTTATtUGGGATOT^ 

GGtCAAAACAGAAAAATATGTAAAAAGTTATTGTITTTAT^ 
TTTTAAAAATOGTATCKOTAQAACTAA^ 
AGAGAGGGATITGATQAATGCTAAAGCATCATGAAAAAT^ 
C^GAATCAATTAAATTCATAT^ 

GAGTGGATAAACIGTOTAACGTGAGTGCTATO^ ~ 
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<nAAGATOTTACnAG<jK3TC^ 

ATOTAATITGOOQAAAAGACAATITOT 

ATCCCmTOGATATIX3TA\TAAACCA 

GCAGGAGGAA01X3AAATGAGATGGTAQAAACK3AAAG 

GGTOOAATCTAGATATCnTAATATAT^^ 

AAAACCAACAGGAGm^GGGAGACAATAGGAACX^ 

TAAAAACAAAAGATTITIXK^GAAQAATTATITAT^ 

GAACTITGAAGTTCATTXjATAGlXTC 

TTrATrATTAAJtACTCTT^^ 

ATCKnTTTX^CCATAC^CAAATGTAOG<^ 

GGAGA<X5GAAAGGAAGCGG^ 

ATQAACAAGAAAAAATQTGOACACATGCACAAAAATT^ 

TATTCTAATATTAAGAAAATAATAAACCCATTI^ 

TTATCITGGCCACAGAlXXlXnX^ 

AATO^TCCAGGGGATGATGOTACTC^ 

AAGTAAC£ATGACATCTCTCIX5^^ 

GTATGIICTAATIICTCTAAT^^ 

T(3<jAGTTAAGG1xXCATAOGA^ 

TTXJIX3AAAGTTCCTTrcAaiAAACA 

a<x^ataacaaacixxaaataattcuoaa<^ 

TOX&O^rWATCACAAC^^ 

tCOtXTTOGTACATC^ 

<^GKnATCICOW^ATrrAA^ 

AccrcocxtJtt&A^ 

ACCX^TGGA<XTGTGACTG€CATTTAAAGTG<;TTG^ 

Nae 11*80 f 

ACTAiCTiUiTitfmGiriurnu 

TATCXX^VAAAGTCCCOCATA^ 
CCCIGGCAITCCCCTOTACIX3GGGC^ 
AGTGATGGCCAACTAGGCCATCTTTTGATAC^ 
TACTGGTTAGTTCATAATGTIGTTGC^ 

N«clW7^ 

(SEQIDNO:3) 
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Figure 8 
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Intron 


No. 


start 
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length 
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1 


1 


542 
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2 


2898 


3094 
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1334 


3 


4428 


4557 


130 
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4 


7625 


8019 


394 





Figure 9 
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Mr, 



P5 I 





1 




1 


cDHXjcat 


51 


g^oaicjra 


1 




101 




1 


cBKArat 


151 




1 




201 




1 








1 


cD&Ajrat 


301 




x 


cSK\_rat 


351 




1 




401 




1 


cENAjrafc 


451 




1 




501 




1 




551 


genomic bu 






601 








€51 




1 


cBNArat 


701 




1 


cDN&rat 


751 


ganoaiejiu 


1 




801 


gonottic hu 


1 




SSI 


gr*noaioJbti 


1 




$01 




1 



ogagagacgacagaaggttacggcfcgcgagaagacgacagaagggtccag 
aaaaaggaaagtg<^ggaggggagtggggacaaaagoagcgaccaagtga 
atgtcacttcagtgactgeggccaggcaaaaogogcgggaaggattttgt 
gtagcttgggaccctttcatagacactgatgacacgtttaogoaaaatag 
aaatttgaggagaaacigccfcg^gccttcggaaaggagtgattgattagta 

cttgcaagtttaggtgactttaaggagaactaactaatgtatactattga 

gggaggaggaagagcattacagagfcttccagcagcagcaggaaagctttg 
gtfcaatttggaaafcggatgatagcattaaaataacagaagcgcetccagg 
totctgaagottcagtcccccagctgaaagocagaaaagactaagcccac 
taagccttttgatecctttggaagcaaagaactttccttcectggggtga 
^gactctoctoagaagatttcctgtctctgcctatgttacaagaggaatc 

aaaaccaagacagaagagctcaggatgcaggtgagaggcagggaagtcag 
oggcttgttgat ot occtoactgct gt ctgcctggtggtcaoccctggga 
gcagggcotgtoctogccgctgtgcctgctatgtgcccacagaggtgcac 
tgtacatttcggtacctgacctccatcccagatggoatcccggccaatgt 
ggaacgaataaatttaggatataacagcottactagattgacagaaaacg 
actttgatggcotgagcaaactggagttactcatgctgoaoagtaatggc 
attcacagagtcagtgacaagaccttctcgggcttgcagtccttgcaggt 
cttaaaaatgagctataacaaagtccaaatcattcggaaggatactttct 
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$51 
1 


acggactcgggagcttggfcccggttgcacctggatcacaacaacattgaa 


tSDKXjcat 


1001 

1 


ttcatcaaccctga^gccttttatggacttacctcgctccgcttggtaca 


geaoaicjtax 


1051 
1 


tttagaaggaaaccjyctcacaaagctccatccagacacatttgtotcat 


«DK&_ac&fc 


1101 

1 


taagctatctccagatatttaaaacctctttcattaagtacctgttcttg 




1151 
1 


tctgataaetfccctgacctccctccoaaaagaaatggtotcctaoatgco 




1201 
X 


aaacctagaaagcctgtatttgcatg$aaacccatv$acct$tgactjcc 




1251 
1 


afcttaaagtggiitgtotgagtggatgcagggaaaccoagat&taataaaa 


ffeooaicjbu 


1301 

1 


tgcaagaaagacagaagctcttooagtcotcagcaatgtcccctttgcat 


cDNArat 


1351 
1 


gaaccccaggatctctaaaggcagaccctttgctatggt&coatctggag 




1401 
1 


ctttcctatgtacaaagccaaccattgatccatcactgaagtoaaagago 




1451 
1 


ctggttactcaggaggacaatggatctgcctccacctcacctcaagattt 




1501 
1 


catagaaocxjtttggctocttgtctttgaacatgacananntntctggaa 


cDN\rat 


1551 
1 


ataaggccgacatggtctgtagtatcoaaaagccatcaaggacatcacca 


geae£ta_3ni 


ieoi 
l 


ectgcattcactgaagaaaatgactacatcatgotaaatgcgtcattttc 




1$51 
1 


cacaaatcttgtgtgcagtgtagattataatcacatccagccagtgtggc 


cDNAjrat 


1701 
1 


aaottctggctttatacagtgactctcctctgatactagaaaggaagccc 


cD8\rat 


1751 
1 


cagcttaccgagactccttcactgtcttctagatataaaoaggtggotot 


<JDNA_jrat 


1801 
1 


taggcctgaagacatttttaccagcatagaggctgatgtcagagcagacc 


yonoaiic_ha 


1$51 
1 


ctttttggttccaacaagaaaaa&ttgtcttgcagctgaacagaactgcc 




1$01 
1 


accacacttagoacattaoagatocagttttcoactgatgctcaaatcgc 


cDNA_rat 
genomie_lra 


1951 
1 


tttaccaagggcggagatgagagcggagagacticaaatggacoatgatoc 
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□ 

vl3 ■ 

!,fl 
HI 

*5 



ru 

S'U 

n 
ru 



g>enoaia Jtca 

g^noaicMhu 
g«*no£i.c_hu 
getwiicjtiti 
g<w*oiic_fau 

«S3MXxafc 
g^OOLicJhu 

gonomic^lm 

g«aom±c_iiu 

g^nomic^lm 

gonoaLi.c_hTi 

g'Q&OSli.Q llH 

ganojaicjhu 

CDNArat . 
gonoaiic__bti 

oON&xat 



2001 
1 

20S1 
i 

2101 
1 

2151 
1 

2201 
1 

2251 
1 

2301 
1 

2351 
1 

2401 
1 

24S1 
1 

2S01 
1 

2S51 
1 

2601 
1 

2651 
1 

2701 
1 

2751 
1 

2801 
1 

2S51 
1 

2901 
1 

2951 
1 



tgatgatgaacaatcoaaaaotggaacgoaatgtoctggfctggcggcact 
attgccotgagctgtooaggoaaaggogaoccttcacctcacttggaatg 
gcttctagctgatgggagtaaagtgagagccccttacgttagogaggatg 
ggcgaatcetaatag&caaaaatgggaagttggaactgcagatggctgac 



agctttgatgoaggtotttaccaotgcataagcaccaatgatgcagatgc* 



ggatgttctcacatacaggataactgtggtagagcccta.tggagaaagca 



cacatga.cagtggagtcc«gc«cacagtggttacgggtgagacgctcgacj 



cttcoatgcctttccacgggtgttccagatgcttotattagctggattct 
tccagggaacactgtgttctctcagocatcaagagacaggcaaattctta 

acaatgggaccttas.gaatattacaggttacgccaaaagatoaaggtcat 

taccaatgtgtggctgccaaccc^tcaggggccgacttttccagttttaa 

agtttcagttcaaaagaaaggccaaaggatggttgagcatgacagggagg 

caggtggatctggacttggagaacooaactccagtgtttcccttaagcag 

ooagcatctttgaaactctotgoatcagctttgacagggtcagaggctgg 

aaaacaagtctecggtgtacataggaagaacaaacatagagacttaatac 

atcggcggcgtggggattccacgctccggcgattcagggagoataggagg 

cagctccotctctctgctcggagaattgacccgcaacgctgggcagcact 

tctagaaaaagccaaaaagaattctgtgccaaaaaa^caagaaaatacca 

cagtaaagccagtgccactggctgttcccctcgtggaactcactgacgag 

gaaaaggatgcctctggoatgattcctccag&tgaagaafctcatggttct 
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genomi<:__hTi 


3001 
1 


gaaaac*^aggcttctggt$rtcooaggaaggtcaccaactgctgactotg 




3051 
1 


ga ccagta aa teat ggttttatgaogagta tagcttctggcacagaagt c 


ct>KA_rAt 


3101 
1 


tceactgtgaatocacaaacactaoaflLtctgagcaccttcotgatttcaa 


g^okicTjbtt 


3151 
1 


attatttagtgtaacaaacggt«cagctgtgacaaa^agtatgaacccat 


SGnoaicJfcu 


3201 
1 


ccatagoaagcaaaat«igaagatacaaooeLaooaaaacccaatcattatc 


<CK\_rat 
gaaoaio_iia 


1 


tttccatcagtagctgaaattcgagattctgctcaggoaggaagagoatc 


<£JKX_rat 


3301 
1 


ttcccaaagtgcaoacoctgtaacagggggaaacatggctacctatggce 


gaaoolajtw 


33S1 
1 


atacoaacacatatagtagctttaccagcaaagccagtacagtcttgcag 


<DHfc_rat 


3401 
1 


ccaataaatccaaoagaaagttatggacctcagatacctattacaggegt 




3451 
1 


cagcaga cot agc«gtagfcgacat ctct tctcacactact goagacec t a 


csDKfcrat 


3S01 
1 


gcttctcoagtoacccttcaggttcacacaccactgcctcgtctttattt 




3S51 
1 


oacattcctagaaacaacaatacaggtaacttc<x>ottgtccaggcactt 




3601 
1 


gggaagagagaggacaatttggagcagagggagagttaaaaacccacata 


cDNAjraf; 
g«oottic_lra 


3fiSl 
1 


ga a ccccagtt ctccga cggcatagacaoaggactgt gaggecagcaa tc 


gaiiomiaJhTi 


3701 
1 


aaggga cot gctaacaaaa a tgtgagccaagt tccagccacag&gt accci 


genomic hu 


3761 
1 


tgggat gt gccacacat gtcct teegcaga 9 gggctcacagt ggot aefcg 


cDNAjrat 
genonicjux 


3€01 
1 


cagcactgtcagttccaagttcatcccacagtgooctccc:<iaaaactaat 




3851 
1 


aatgt tggggt ca tagca gaagagt ctaccactg tgg tcaagaa accact 


<sDNfc_rat 


3901 
1 


gttactatttaaggacaaaoaaaatgtagatattgagataataacaacca 


genomicj*"! 


39S1 
1 


c taca aa at a 1 1 ccgga ggggaaagtaaccaogt gat to eta eggaagea 


cDN\_rat 


4001 
1 


agca tg ac 1 1 c tgc t cca acat ctgtat ccc t ggggaaa tct cc tgtag a 




4051 
1 


caatagtggtcacctgagcatgcctgggaccatccaaacfcgggaaagatt 


d>NA_rat 
gGnomio_hti 


4101 
1 


cagtggaaacaacaccacttcccagccccctcagoacaccctcaatacca 
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3 n 



3 



gotiosti c__H"U 

<SDS&xat 
goao£ic_lm 

cONAjrat 

cDHk_rat 
gonoal-cjbn 



4151 
1 

4201 

1 

4251 
1 

4301 
1 

4351 
1 

'4401 
1 

4451 

1 





4S01 




1 




4551 


g^noaicjhu 


1 


cDH&xat 


4601 




1 


cDNA_rat 


4651 


gefx>miejbtt 


1 




4701 




1 




4751 


geaoaicjhu 


1 




4601 




1 


cDNA_rat 


4851 




1 




4901 


gsnoaicjhu 


1 


cDHA_rat 


4951 




1 




5001 


genoaic^hu 


1 


<3DKArat 


5051 




1 


cDNArat 


5101 


genomic Ira 


1 


cDKA_rat 


SlSl 


^©noSto Ira 


1 




5201 


geaoatic im 


1 



aeaagca<»aaattctca&agaggaaaactccottg«ocagatctfctgt 

aaataaccagaagaaggaggggatgttaaagaatccatatcaattcggtt 

tacaaaagaacccagccgoaaagottcccaaaatagctcctottttaccc 

aoaggtcagagtteeccctcagattctacaaotctcttgacaagtecgoc 

accagctctgtctacaacaatggatgccactcagaacaagggcactgaag 

tagtatcaggtgccagaagtctctcagcagggaagaagcageccttcacc ■ 

aaotoototccagtgcttcctagcaccataagoaagagatctaatacatt 

aaacttcttgtcaacggaaacocccaoagtgacaagtcetaetgctactg 

catctgtcattatgtctgaaacccaacgaaeaagatecaaagaagoaaaa 

gaccaaataaaggggcctcggaagaacagaaacaacgcaaacaccaoacc 



~??!!!?!! tct99C ! atastffo * tactcagctct « 8caaca ?otgata 

coccottggctttcagtcattcoccacgacaagatgatggtggaaatgta 
ff!?^ ^5 ettatcactcaac aao«tcfc>=ttctggcoataactgaact 
™!_?!f!!^!ff CCCagacttt939aaataca&ca 3 cttt S5«a«caa 

!!!!!!!!!! c !!! cctcagoagtS3g9C?ccccca ? t30coactcct tc 

!!!!!!!!!" ttaCtaag5gtgt9srttacagaca 5caaagtcacatcag 
!!^f!?f!?!!?! Caa !"f a5tsffftcac<satatat 9 a *tct:tcaagg 
-^^^ff^f^f^'^^^^aggctagccccaatootga 

?!-!f!!!!!! ga !! cactgactctocctctaatct 9 tttc «tc<:actt 

f*?!fff! g f!^!!^ aSa ^ aaac ^ aca ^ attcfca *atggaagcco 
!!!^!!? gCCagaaCaCaaatatCa5Ctcaa 5 tcat a<:tccg aa a C cat 
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cDK&_rat 5251 tgagaagggcaaaaggccagcagt&agc&tgtccccccacctcagccttc 

9^noalo__fea 1 — — >« — — — ™ 

c©K\jcat 5301 cagaggccageactcatgcctcacactggaatacacaga.agcatgcagaa 

genoaie_hu 1 : - 

cOK^rat 5351 aagagtgtttttgataagaancctggtcaaaaoocaacttccaaacatct 

_genoaic_hu 1 — — — — — — -™™~ .--l— — «- 

cDNA^rat 5401 gccttacgtctctctacctaagactotattgaaaaa^ccaaga&taattg 

gono*^e_hu 1 — ——™— « 

<£S&jcat 5451 gaggaaaggotgcaagctttacagttccagctaattcatfaogtttttctt 

cSa^xat 5501 ccttgtgaggotgttggagacccactgcccatcatccactggaooa^gt 

goaoaicjax 1 — — - — -~ - 

cBH^rafc 5551 ttoatoagganttgaaatatcccaagggacacagaaaagccggrttecacg 

geivciic^Jiv. 1 ~< — — — ^ 

cDKA^rat 5601 tgcttcccaatggcaccttgtc^tccagagggtcagtattcaggaccgfc 

g^noftic^lro 1 — * 

^SJKK^xat 5651 g^racagtacctgtgctctgcatttaatocactgggcgtagaccattttca 

geaoaie__lwi 1 — ■ * * 

cdtt^xat 5701 tgtotctttgtctgtg^ttttttacacggcaaggattttggacagacatg 

g^nomi «_Hu 1 ~- « — — , 

<$Kkxafc 57t>a rcaagga^rcsueaeagfttcactttggaagtactgtgrgaactaaagtgcaga 

grenosaicjva 1 * 

dDHAjrat 5801 gtggagggtatgccgaggcctacggtttcctggatacttgcaaaccaaac 

^aoai.c_liu 1 

dDKKx&t sesi - ggtggtctcagaaacggccaagggaagcagaaaggtctgggtaacacctg 

g^noaiicjiu 1 " 

cDK\jrat 5501 atggaacattgatcatctataatctgagtctttatgatcgtggtttttac 

genomlo^txtt 1 • 

cDK\_rat 5951 aagtgtgtggccagcaacccatctggccaggattcactgttggttaagat 

gsnomio__tra 1 — - — " -™* 

€&xt\_ x^t €001 acaagtcatcacagctcccoctgtcattatagagcaaaagaggeaagcca 

g^noaic__hu 1 « 

cDHk_r&t €051 tcgttggggttttaggtggaagtttgaaaotgocotgcactgcaaaagga 

g^noaic^tJ 1 . 

dDHA rat €101 actccccagcctagtgttcactgggtcctttatgatgggactgaactaaa 

ye»omio_hu 1 *" 

cBHA_rat €151 accattgcagttgactc^ttccagatttttcttgtatccaaatggaactc 

g6aoai.cJte 1 * — * ' 

cDM& rat 6201 tgtatataaga^gcatcgctcctt<^gtgaggggcacttatgagtgcatt 

g^nomi q_ku 1 

cDK^x&t 6251 ^rccaccagctcctcaggctcagagagaagggtagtgattcttactgtgga 

gan-canic Ihj 1 — ~ — - — 
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cDNA_*at 6301 agagggagagacaatccccaggatagaa&ctgcctctcagaaatggactg 

g^nomicjhn 1 — - — 

oDNA_rafc aggtgaatttgggtgagaaattactactgaactgctcagetactggggat 

gaaosiic _hu 1 ■ 

^0K\jrat €401 w^aagcctagaataatctggaggcl^ccatccaaggctgtcatogacca 

genomicjhn 1 • — — - — — — — — — 

cDNA_r&t €451 gtggca ca gaaTOGGCftGCCGftATCCACGTCTACXXIAAATGGATCCTTGg 

gwoaicjiu 1 . ~ - -TGGGCAGCTGGATCC^CC^CTACCCTAATGGATCCCTGt 

COHfcMt €501 tggttgggTCAGTGACG<3AAA^(^C(^TGGTG^CTACTTATGTGTGGCA 

gmealGjxa 40 ttattggaTCAGTAAGAGAAAAAGAC^GTGOTGYCTACTTGTGTGTGGCA 

oDUkjeat €551 AGAAACAAAATGGGAGATGACCTAGTCCTGA^ 

gaaooicjra 90 AGAAAC^UUUlTGGGGGATGATCTGATACTGATGCATGTtagcctaAGACT 

cDKAxsit 6601 GACACCTGCCAAMTTGAACAGMGCAGTATTTTAagMGCAAGTGCTCC 

genomicjui 140 GAAACCTGCCAAMTTGACCAC^GCAGTATTTTAgaAAGCAAGTGCTCC 

eDHA_r»t. €€51 ATGGGftAAGATTTCCAAGJTGACTGCAaggcctctOGCTCCCCTGTGCCT 

•geoe&iojui 190 ATGGGAAAGATTTCCAAGTAGATTGCAaagcttccGGCTCCCCAGTGCCA 

cfiN&jcet. €701 GAGGTATCCTGGAGTTTGCCTGATGggacagtgcXCAACAATGTAGCCCA 

^^aoaiejw 240 GAGATATCTTGGAGTTTGCCTGATGgaacoatgaTCAACAATGCAATGCA 

eDHAjrat €751 AGCTGATGACAGTGGCTATAGGACcaagaggtacACCCTTTTCCACAATG 

genomicjiu 5 90 AGCCGATGACAGTGGCCACAGGACt aggagat atACCCTTt TCAACAATG 

cPNAjcat €801 G^CXTTGTATTtTCAACAACCTTGggatggoaGAGGAAGGAGATTATATC 

gano5i£:_>a 340 GAACTTTATACTTCAACAAAGTTGgg^tagcgGAGGAAGGAGATTATACT 

cDKA rat €€51 TGCTCTGCCCAGAJfiC^CCTTAGGGAAAGA^ 

genomicjw 390 ?GC2ATGCCCAG&ACACCCT^^ 

cDKA rat €901 AGTTCTMCAGCcatcCCACGGATAAGGCAAAGCTACmGACCACCAT^ 

g6no5ic_hu 440 AGTTATAACAGCtgctCCCCGGATAAGGCAGAGTAACAAAACCAAC^AGA 

oDNA_rs.t €951 GGCTCAggGCTGGAGAAACAGCTC^ 

genomifcjm 490 GAATCAaaGCTGGAGAC^CAGCTGTCCTTGACTGTGAGGTCACTGGGGat 

<SDWA_ra.t 7001 cogaago<3caatg^ATTTTGGTTGCTGCC^TCCAAQaat:gtcAT$TCATT 

^j«no»loJbti €40 c<?<^aaocaaMatAYTTTGGTT<^TGCCTTCC%AtgacatgATTTCCTT 

oDHA xat 7051 CTCCAATGACAGGT TCACATTTCA? GCCAATagaaCTTTGTCCATCCATA 

$£AOftieJro 590 CTCCATTGATAGGTACACATTTCATGCCAATgggtCTTTGACCATCAACA 

CDNA_rat 7101 AAGTGAAACCACTTGACTCTGGGGActatgtgtgcgtagctgagAATCCT 

g€inomie_hu €40 AAGTGAAACtGCTCGATTCTGGAGAgtaogtatgtgtagcccgaAATCCC 

cDNA_rat 7151 AGTGGGGATGACACTAAGACATACAAACTGGAcattGTCTCTAAACCTCC 

genomicjiu €90 AGTGGGGATGACACCAAAATGTA^AJUVCTGGAtgtgGTCTCtAAACCTCC 

cDNAjrat 7201 ATTAATCAATGGCCTGTATGCAAACAagACTGTTATTAAAGCCACAGCca 

genomic Jhu 740 ATTAA^CAATGGTCTGTATACAAACAgaACTGTTATTAAAGCCACAGCtg 

cD^xat 7251 ttcggcacTCCAAAAAATACTTTGACTGCAGAGCAGATGGGATCCCATct 

genoaicjbti 790 t gagacat TCCAAAAAACACTTTGACTGCAG&GCI GAAGGGACACCAT ct 

dMSkxvt 7301 tcccaggtcacGTGGATTATGCCAGGCAATATf^TCCTCCCAGCTCCATA 

genomic hu €40 cctgaagtcatGTGGATCATGCCAGACAATATT^^CCTCACAGCCCCA'TA 
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cDKA_rat 7351 CttTGGMGGAGAGTCACGCTC^TCCW^TOGAACC^SHStSAgatga^ga 

g^cmiefctx 890 Ct a? GGAAGCAGAAT CACAGTCCATAAAAATGGAACCTTGGAaat t agg a 

cDKA_3rat 7401 acatccGGCTXTC^^CTGTGCGCMTTCACCTGTGTGGttcggagcgag 

g^noaicjbu 540 atgtgaGGC^TTCAGATTCAGCCGACTTTATCUGTGTQGccogaaatgaa 

cDHA_rat 7451 ggaGGAGAGA(^GTGTTGGTAGT0CAGTTAGA^GTCCTAGAAA1!GCTGAG 

genomiejHl 990 ggtGGAGAGAGCGTGTTGGTAGTACAGTTAGAAGXACTG^AAftTGCTGAG 

cDNA_rat 7501 AAGACCAACATTCAGAMCCraTTCAACGAA^^ 

ganc*i<sjiu 1040 AAGACCGACATTTAGAAATCCA^2AATGAAAAaatagttgcccagctgg 

oDHA rat 7551 gcaagcccgtAGCACTG^TCK^^ 

STdnoiioJxu 1090 gaaagtccacA^CATTG&ATTGCTCTCMGATGOTAACCCACCACCTSSA 

cDNArat 7601 at tacCTGGATGTTA<XTGACG<XACACXGTTt GCTMCAG^CCAcaca a 

g«ao»ioJw 1140 ataatCTGGATCTTACCAAATGtKl^ 

. eDHA_rat 7651 ttccccGTATCTGATGGCAGGCAATGGCTCTC^ 

genoJdcjbu 1190 ttatcaGTATCTGATAGCAAGCAATGGTTCTTTTATCATTTCTAAAacaa 

cDKAjrat 7701 ctcggaacaagtCAGGGAAGTATCGCTGTGCAGCCAGGAATAAGGTTGGC 

genoaicjfcvu 1240 ctcgggaggatgCAGGAAAATATCGCTGTGCAGCTAGGAATAAAGTTGGC 

oDHAjt&t 7751 ^AcatcgagaaactcatcctgttagagAOTGGGCAGAAGCCAGTCATTCT 

gftnoaie^Sitt 1290 TAtattgagaaattagtcatattagaaAT^GGCCAGAAGCCAGTTATTCT 

<£>HA xat 7801 GACATACGAACCAGGGAtggtgaagagcgTCAGTGGGGAACCGTTATCAC 

g^noalcjiu 1340 f ACCTATGCACCAGGGAoagtaaaaggoaTCAGXGGAGAATCTCTATCAC 

cDKAjcat 7851 TGCATTGTGTGTCTGATGGgATCCCCAAGCCAAATG^CAAGXGGACTACA 

genomic^bu 13 90 TGCATTGTGTGTCTGATGGaATCCCTAAGCCAAATATCAAATGGACT ATG 

CsDKA rat 7901 CCGGGTGGccATGTAATCGACAGGCCTCAAGTGGATGGAAAATACATAcT 

^noAi^Jhu 1440 CCAAGTGGttATGTAGTAGACAGGCCTCAAATTAATGGGAAATACATAtT 

cDKA_rat 7$5l GCAT GAAMTGGCACGCTGGTC^TCAAAGGAACAACAGCt cacgaccaAG 

9«MBie>» 1490 GCATGACAATGGC&CCTTAGTCATm 

cfflafcjcat 8001 GAAATTATATCTGTAGGGCTCAAAACAGTGTTGGCCAggcagttattagc 

ge&MicJra 1540 GAAACTATATCTGTAAGGCTCAAAATAGTGTTGGTCAtacactgattact 

tiDM&jrmt 8051 g t gt CAGTGATGGTTGTGGCC?ACCCTCCCCGAA!T CATAAAc t a c ctAC c 

g^moaicjw 1590 gttcCAGTAATGATTGTAGCCTACkXITCCCCGAATTACAAAtogte^CC 

<5DNAjcat 8101 C^GGAACATGCTCAGGAGGACAGGGGAAGOCatgCAGCTCCACTGTGTGG 

gmonic^lrc 1640 CAGGAGTA^TGTCACCAGGACftGGGGC^GCCtttCAGCTCCACfGTGTGG 

cDNA_xat 81S1 CCTTGGGAATCCCCAAGCCAaAAGTCACCTGGGAGACGCCAAGAGACTCC 

g«nottic_b» 1690 CCTTGGGAGTTCCCAAGCCAgAMTCACATGGGAGATGCCTGACXACTCC 

cfcNArAt 8201 CTGCTCTCAAaagcaacagcaagaaaacccCATAGAAGTGAGATGCTTCA 

genoaiejiu 1740 CTTCTC^CAAcggcaagtaaagagaggaoaCATGGAAGTGAGCAGCTTCA 

cDNA^rat 8251 CCCACAAGGTACGCTgGTCATTCAGAATCTCCAAACCTCGGATTCCGGdg 

ganoalojiu 1790 C1?TACAAGGTACCCTaGTCATTCAGAATCCGCAAACC!TCCGATTCTGGga 

cDHfc_rat 8301 tcTATAAGTGCAGAGCTCAGAACCTACTTGGgacTGATTACGCAACAACT 

ganoalejro 1840 taTACAAATGCACAGCAAAGAACCGACTTGGtagTGATTATGCAGCAACG 

cDHAjrat 8351 TACATCCAGGTACTCTGACAGGAAgggggagactaaaattcaacagaagt 

g<s*xdatic_im 1890 TATATTCAAGTAATCTGACATGAAataataaagtcaacaacatctgggca 



23/90 



g^noaicjm 1940 gaa --TTTAT!TCmt?GAAGAAGTTTAAT^ 

dm rat- 6451 TAGGCATCTAAATGAgt cTGAATACATTTACAGTAT T AAATTTACAATGG 

g«na^ejm 1970 TAGGC&TGTAAATGM t tTGAAT ACATT TACAGTATTAAATTTACAATGA 

cBNfcrat $501 ACATGCgatga GACTTGTAAATGAAAGCATTGTGAACTGAa accg 

y^noaic^hu 2029 ACATCCaaaataaaagGACTTGTAAATAMTGCATTAT^ 

<sDKA B rat 6546 agtctctg T<^TC^CAAAGCAAACTCTXAA<^MGGCACTTTg 

gsawaticjui 207P ctgatttatttaaTGGASCTCAAAACAAACT^ 

cDNA rat 8591 MmGCCAACAAATAA^C^ 

gen<?^icjbu 2129 AT^TCCCAACAAATAACAATAAACAaacattgaaacggtt CACTA 

c£>HA_rat S641 CGAAAIAACAAACGOCTAATGCACCTGAAT^ 

^zioSiqJrtl 2175 ?A^2iUlCAAATGGCTAA^ 

cOtf&x&t 6*91 tctcgctaac^gttgC<^GCT<KX^COTO?CTGTTTCCTACCAATGTCAC 

ganosdcjm 2225 ttctaata CCAGTTGCC^STX^CCACCT^ 

cDKA «tfc 6741 AAACAtogcacacagggtGAATGGAGTCAAC£GGAAA 

gon<mic_hti 2260 AAGCAtggcactca GAACAGAGACAATGGAAAATATTAAATCTGCA 

cDKA_rat 8791 GTCtgtgtaaatctca ATGTACAAATA^TCTGtcncTG 

g«n<micjiu 2314 ATCtttatgatgtaaatttaccatcctgATGtATAAATAtTTTG TG 

exonl (2342-2397) 

oDKA_rat 8791 GTCtgtgtaaatctca ATGTACAAATATTCXGtcncTG 

SK*noaic_hu 2314 ATCtttlitgatgtaaatttaccatoctgATGTATAAATAm'TG-- XG 

cDKA_rat «829 GTTTATAAACMMT<^TAAAACCOAAAAAAAAAAAAAaaaaaaaAaaaa 

ffUMOiQjia 2360 GTTTATAAATTTTT TTGCTAAAACCTACAGAAAAT AAGca c t g aa ct gt c 



(Genomic human OCP: SEQ ID NO:4) 
(cDNA rat: SEQ ID NO:5) 
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Figure 1 1 
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608 Human translated nucleotide sequence (ORF) 

atgaaggtaaaaggcagaggaatcacctgcttgctggtctcctttgctgt * 

gatctgcctggtcgccacccctgggggcaaggcctgtcctcgccgctgtg 

cctgttatatgcctacggaggtacactgcacatttcggtacctgacttcca 

tcccagacagcatcccgcccaatgtggaacck:atcaatttaggatacaac 

agcttggttagattgatggaaacagatttttctggcctgaccaaactgga 

gttactcatgcttcacagcaatggcattcacacaatccctgacaagacctt 

ctcagatttgcaggccttgcaggtcttaaaaatgagctataataaagtcc 

gaaaacttcagaaagatactttttatc^ 

CACATCKjACCACAACAATATTGAGT^ 

gctcaactttctccgcctggtgcacttggaaggaaatcagctcactaagct 

ccacccagatacatttgtctctttgagctacctccagatatttaaaatctc 

tttcattaagttcctatacttgtc^ 

gagatggtctcctatatgcctgacctagacagcctttacctgcatggaaa 

cccatggacctgtgattgccatttaaagtggttgtctgactggatacagcc 

agatgtaataaaatgcaaaaaagatagaagtccctctagtgctcagcagt 

gtccactttgcatgaaccctaggacttctaaaggcaagccgttagctatg 

gtctcagctgcagctttccagtgtgccaagccaaccattgactcatccctg 

aaatcaaagagcctgactattctggaagacagtagttctgctttcatctct 

ccccaaggltrcatggcaccctttggctccctcactttgaatatgacagat 

cagtctggaaatgaagctaacatggtctgcagtattcaaaagccctcaag 

gacatcacccattgcattcactgaagaaaatgactacatcgtgctaaata 

cttcattttcaacatttttggtgtgcaacatagattacggtcacattcagc 

cagtgtggcaaattttggctttgtacagtgattctcctctgatactagaaa 

ggagccacttgcttagtgaaacaccgcagctctattacaaatataaacag 

gtggctcctaack:ctgaagacatttttaccaacatagaggcagatctcag 

agcagatccctcttggttaatgcaagaccaaatttccttgcagctgaaca 

gaactgccaccacattcagtacattacagatccagtactccagtgatgct 

CAAATCACTTTACCAAGAGCAGAGATGAGGCCAGTGAAACACAAATGGA 

CTATGATTTCAAGGGATAACAATACTAAGCTGGAACATACTGTCTTGGTA 

GGTGGAACCGTTGGCCTGAACTGCCCAGGCCAAGGAGACCCCACCCCACA 

CGTGGATTGGCTTCTAGCTGATGGAAGTAAAGTGAGAGCCCCTTATGTCA 

GTGAGGATGGACGGATCCTAATAGACAAAAGTGGAAAATTGGAACTCCA 

GATCK3CTGATAG*mTGACACAGGCGTATATCACTGTATAAGCAGCAATT 

ATGATGATGCAGATATrCTCACCTATAGGATAACTCTGGTAGAACCTTTGG 

TCGAAGCCTATCAGGAAAATGGGATTCATCACACAGTTTTCATTGGTGAA 

ACACTTGATCTTCCATGCCATTCTACTGGTATCCCAGATGCCTCTATTAGC 

TGGGTT'ATTCCAGGAAACAATGTGCTCTATCAGTCATCAAGAGACAAGAA 

AGTTCTAAACAATGGCACATTAAGAATATTACAGGTCACCCCGAAAGACC 

AAGGTTATTATCGCTGTGTGGCAGCCAACCCATCAGGGGTTGATTTTTTGA 

TTTTCCAAGTTTCAGTCAAGATGAAAGGACAAAGGCCCTTGGAGCATGAT 

GGAGAAACAGAGGGATCTGGACTTGATGAGTCCAATCCTATTGCTCATCT 

TAAGGAGCCACCAGGTGCACAACTCCGTACATCTGCTCTGATGGAGGCTG 

AGG1TGGAAAACACACCTCAAGCACAAGTAAGAGGCACAACTATCGGGA 

ATTAACACTCCAGCGACGTGGAGATTC/VACACATCGACGTTTTAGGGAGA 

ATAGGAGGCATTrCCCTCCCTCTGCTAGGAGAATTGACOCACAACATTGG 

GCGGCACTGTTGGAGAAAGCTAAAAAGAATGCTATGCCAGACAAGCGAG 

AAAATACCACAGTGAGCCCACCCCCAGTGGTCACCCAACTCCCAAACATA 

CCTGGTGAAGAAGACGATTCCTCAGGCATGCTCGCTCTACATGAGGAATT 
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TATGGTCCCGGCCACTAAAGCTTTGAACCTTCCAGCAAGOACAGTGACTG 

CTGACTCCAGAACAATATCTGATAGTCCTATGACAAACATAAATTATGGC 

ACAGAATTCTCTCCTGTTGTGAATTCACAAATACTACCACCTGAAGAACQC 

ACAGATTTCAAACTGTCTACTGCTATTAAAACTACAGCCATGTCAAAGAA 

TATAAACCCAACCATGTCAAGCCAAATACAAGGCACAACCAATCAACATT 

CATCCACTGTCTTTCCACTGCTACTTGGAGCAACTGAATTTCAGGACTCTG 

ACCAGATGGGAAGAGGAAGAGAGCATTTCCAAAGTAGACCCCCAATAAC 

AGTAAGGACTATGATCAAAGATGTCAATGTCAAAATGCTTAGTAGCACCA 

CCAACAAACTATTATTAGAGTCAGTAAATACCACAAATAGTCATCAGACA 

TCTGTAAGAGAAGTGAGTGAACCCAGGCACAATCACTTCTATTCTCACAC 

TACTCAAATACTTAGCACCrCCACGTTCCCTTCAGATCCACACACAGCTGC 

TCATTCTCAGTTTCCGATCCCTAGAAATAGTACAGTTAACATCCCGCTGTr 

CAGACGCTTTGGGAGGCAGAGGAAAATTGGCGGAAGGGGGCGGATTATC 

AGCCCATATAGAACTCCAGTTCTGCGACGGCATAGATACAGCATTTTCAG 

GTCAACAACCAGAGGTTCTTCTGAAAAAAGCACTACTGCATTCTCAGCCA 

CAGTGCTCAATGTGACATGTCTGTCCTGTCTTCCCAGGGAGAGGCTCACCA 

CTGCCACAGCAGCATTGTCTTTTCCAAGTGCTGCTCCCATCACCTTCCCCA 

AAGCTGACATTGCTAGAGTCCCATCAGAAGAGTCTACAACTCTAGTCCAG 

AATCCACTATTACTACTTGAGAACAAACCCAGTGTAGAGAAAACAACACC 

CACAATAAAATATTTCAGGACTGAAATTTCCCAAGTGACTCCAACTGGTG 

CAGTCATGACATATGCTCCAACATCCATACCCATGGAAAAAACTCACAAA 

GTAAACGCCAGTTACCCACGTGTGTCTAGCACCAATGAAGCTAAAAGAGA 

TTCAGTGATTACATCGTCACTTTCAGGTGCTATCACCAAGCCACCAATGAC 

TATTATAGCCATTACAAGGTTTrCAAGAAGGAAAATTCCCTGGCAACAGA 

ACTTTGTAAATAACCATAACCCAAAAGGCAGATTAAGGAATCAACATAAA 

GTTAGTTTACAAAAAAGCACAGCTGTGATGCTTCCTAAAACATCTCCTGCT 

TTACCACAGAGACAAAGTTCCCCTTTCCATTTCACCACACTTTCAACAAGT 

GTGATGCAAATfCCATCTAATACCTTGACTACCGCTCACCACACTACGACC 

AAAACACACAATCCTGGAAGTCTTCCAACAAAGAAGGAGCTTCCCTTCCC 

ACCCCTTAACCCTATGCTTCCTAGTATTATAAGCAAAGACTCAAGTACAA 

AAAGCATCATATCAACGCAAACAGCAATACCAGCAACAACTCCTACCTTC 

CCTGCATCTGTCATCACTTATGAAACCCAAACAGAGAGATCTAGAGCACA 

AACAATACAAAGAGAACAGGAGCCTCAAAAGAAGAACAGGACTGACCCA 

AACATCTCTCCAGACCAGAGTTCTGGCTTCACTACACCCACTGCTATGACA 

CCTCCTGCTCTGGCATTCACTCATTCCCCACCAGAAAACACAACTGGGATT 

TCAAGCACAATCAGTTTTCATTCAAGAACTCTTAATCTGACAGATGTGATT 

GAAGAACTAGCCCAAGCAAGTACTCAGACTTTGAAGAGCACAATTGCTTC 

TGAAACAACTTTGTCCAGCAAATCACACCAGAGTACCACAACTAGGAAAG 

CATCATTAGACACTCCCATACCACCATTCTTGAGCAGCAGTGCTACTCTAA 

TGCCAGTTCCCATCTCCCCTCCCTTTACTCAGAGAGCAGTTACTGACACAC 

GTGGCGACTCCCATTTCCGGCTTATGACAAATACAGTGGTCAAGCTGCAC 

GAATCCTCAAGGCACAATCTCCAAATGCCAAGTTCACAATTGGAACCACT 

CACTTCATCTACCTCTAATCTGTTACATTCTACTCCCATGCCAGCACTAAC 

AACAGTTAAATCACAGAATTCCAAATTAACTCCATCTCCCTGGGCAGAAT 

ACCAATTTTGGCACAAACCATACTCAGACATTGCTGAAAAAGGCAAAAAG 

CCAGAAGTAAGCATGTTGGCTACTACAGGCCTGTCCGAGGCCACCACTCT 

TGTTTCAGATTGGGATGGACAGAAGAACACAAAGAAGAGTGACTTTGATA 

AGAAACCAGTTCAAGAAGCAACAACTTCCAAACTCCTTCCCTTTGACTCTT 

TGTCTAGGTATATATTTGAAAAGCCCAGGATAGTTGGAGGAAAAGCTGCA 

AGTTTTACTATTCCAGCTAACTCAGATGCCTTTCTTCCCTGTGAAGCTGTT 
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GGAAATCCCCTGCCCACCATTCATTGOACCAGAGTTTCAGOACTTGATTTA 

TCTAGAGGAAACCAGAATAGCAGGGTCCAGGTTCTCCCCAATGGTACCCT 

GTCCATCCAGAGGGTGGAAATTCAGGACCGCGGACAGTACTTGTGTTCCG 

CATCCAATCTGTTTGGCACAGACCACCTTCATGTCACCTTGTCTGTGGTTT 

CCTATCCTCCCAGGATCCTGGAGAGACGTACCAAAGAGATCACAGTTCAT * 

TCCGGAAGCACTGTGGAACTGAAGTGCAGAGCAGAAGGTAGGCCAAGCC 

CTACAGTTACCTGGATTCTTGCAAACCAAACAGTTGTCTCAGAATCATCCC 

AGGGAAGTAGGCAGGCTGTGGTGACGGTTGACGGAACATTGGTCCTCCAC 

AATCTCAGTATTTATGACCGTGGCTTTTACAAATGTGTGGCCAGCAACCCA 

GGTGGCCAGGATTCACTGCTGGTTAAAATACAAGTCATTGCAGCACCACC 

TGTTATTCTAGAGCAAAGGAGGCAAGTCATTGTAGGCACTTGGGGTGAAA 

GTTTAAAACTGCCCTGTACTGCAAAAGGAACTCCTCAGCCCAGCGTTTACT 

GGGTCCTCTCTGATGGCACTGAAGTGAAACCATTACAGTTTACCAATTCCA 

AGTTGTTCTTATTTTCAAATGGGACTTTGTATATAAGAAACCTAGCCTCTT 

CAGACAGGGGCACTTATGAATGCATTGCTACCAGTTCCACTGGTTCGGAG 

CGAAGAGTAGTAATGCTTACAATGGAAGAGCGAGTGACCAGCCCCAGGA 

TAGAAGCTGCATCCCAGAAAAGGACTGAAGTGAATTTTGGGGACAAATTA 

CTACTGAACTGCTCAGCCACTGGGGAGCCCAAACCCCAAATAATGTGGAG 

GTTACCATCCAAGGCTGTGGTCGACCAGTGGAGCTGGATCCACGTCTACC 

CTAATGGATCCCTGTTTATTGGATCAGTAACAGAAAAAGACAGTGGTGTC 

TACTTGTGTGTGGCAAGAAACAAAATGGGGGATGATCTGATACTGATGCA 

TGTTAGCCTAAGACTGAAACCTGCCAAAATTGACCACAAGCAGTATTTTA 

GAAAGCAAGTGCTCCATGGGAAAGATTTCCAAGTAGATTGCAAAGCTTCC 

GGCTCCCCAGTGCCAGAGATATCTTGGAGTTTGCCTGATGGAACCATGAT 

CAACAATGCAATGCAAGCCGATGACAGTGGCCACAGGACTAGGAGATAT 

ACCCTTTTCAACAATGGAACTTTATACTTCAACAAAGTTGGGGTAGCGGA 

GGAAGGAGATTATACTTGCTATGCCCAGAACACCCTAGGGAAAGATGAA 

ATGAAGGTCCACTTAACAGTTATAACAGCTGCTCCCCGGATAAGGCAGAG 

TAACAAAACCAACAAGAGAATCAAAGCTGGAGACACAGCTGTCCTTGACT 

GTGAGGTCACTGGGGATCCCAAACCAAAAATATTTTGGTTGCTGCCTTCC 

AATGACATGATTTCCTTCTCCATTGATAGGTACACATTTCATGCCAATGGG 

TCTTTGACCATCAACAAAGTGAAACTGCTCGATTCTGGAGAGTACGTATG 

TGTAGCCCGAAATCCCAGTGGGGATGACACCAAAATGTACAAACTGGATG 

TGGTCTCTAAACCTCCATTAATCAATGGTCTGTATACAAACAGAACTGTTA 

TrAAAGCCACAGCTGTGAGACATTCCAAAAAACACTTTGACTGCAGAGCT 

GAAGGGACACCATCTCCTGAAGTCATGTGGATCATGCCAGACAATATTTT 

CCTCACAGCCCCATACTATGGAAGCAGAATCACAGTCCATAAAAATGGAA 

CCTTGGAAATTAGGAATGTGAGGCTTTCAGATTCAGCCGACTTTATCTGTG 

TGGCCCGAAATGAAGGTGGAGAGAGCGTGTTGGTAGTACAGTTAGAAGTA 

CTGGAAATGCTGAGAAGACCGACATITAGAAATCCATTTAATGAAAAAAT 

AGTTGCCCAGCTGGGAAAGTCCACAGCATTGAATTGCTCTGTTGATGGTA 

ACCCACCACCTGAAATAATCTGGATTTTACCAAATGGCACACGATTTTCCA 

ATGGACCACAAAGTTATCAGTATCTGATAGCAAGCAATGGTTCTTTTATCA 

TTTCTAAAACAACTCGGGAGGATGCAGGAAAATATCGCTGTGCAGCTAGG 

AATAAAGTTGGCTATATTGAGAAATTAGTCATATTAGAAATTGGCCAGAA 

GCCAGTTATTCTTACCTATGCACCAGGGACAGTAAAAGGCATCAGTGGAG 

AATCTCTATCACTGCATTGTGTGTCTGATGGAATCCCTAAGCCAAATATCA 

AATGGACTATGCCAAGTGGTTATGTAGTAGACAGGCCTCAAATTAATGGG 

AAATACATATTGCATGACAATGGCACCTTAGTCATTAAAGAAGCAACAGC 

TTATGACAGAGGAAACTATATCTGTAAGGCTCAAAATAGTGTTGGTCATA 
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CACTGATTACTG1TCCAGTAATGATTGTAGCCTACCCTCCCCGAATTACAA 

ATCGTCCACCCAGGAGTATTGTCACCAGGACAGGGGCAGCCTTTCAGCTC ' 

CACTGTGTGGCCTTGGGAGTTCCCAAGCCAGAAATCACATGGGAGATGCC 

TGACCACTCCCTTCTCTCAACGGCAAGTAAAGAGAGGACACATGGAAGTG 

AGCAGCTTCACTTACAAGGTACCCTAGTCATTCAGAATCCCCAAACCTCC 

GATTCTGGGATATACAAATGCACAGCAAAGAACCCACTTGGTAGTGATTA 

TGCAGCAACGTATATTCAAGTAATCTGA 
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Figure 14 
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r*t_cDNA 

mas etJNA 5 



GAA3XX7TAk!EAjGACftAAA^ 
G<Sl1NXTAA!I&GACaAAAG^^ 



fatmSnJ) +3_corrected 

nuis c&SA 5 



6TCTTTACC7y^TGGA3^7V^ 



rat_CC&3A 

huaan - 5-f-3_correated 
ncus cGNA 5 



ans c5ka 5 



ratjCCNA 

human 5 + 3_corrected 
mas ct&IA 5 



G<^TTCTTCCAGG$AACftC^^ 
G<3GTTATTCCAGGAAACAA1 , GTGCTCTATC^ 
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hts&sin_5+3__co rr ected 

WIS 5 



AT^GGACCTTAAGTMATTAC^^ 



huiaan_5+3^cQrreoted 



XatjaDNA 
mas cDNA 5 



hraSn__5+3 corrected 
Boa cDHA 5 



TTGCTCATCTTMG^ 



rat^cDNA 
was cDNA 5 



AGGC£GGaAA&3U^^ 
AjGGTTGOAA&A(^CACCro 



ratjcDKA 

hu&anj5 +3_cor reot ed 



AGCGACGTGGAGAm?^ 



xatjbOKA 

fcunSn_5+3_co rreajted 
mus cDttA 5 



CPGCTCGGAG^TGAC^^ 



xat_cOfcl& 
tCUS cONA 5 



C^TGCCAAAA?&0CA&3AA^ 



rat^cC^A 

h\fflSn^5^3_corrected 



mis cDKA 5 



mis cBNA 5 



rat_c£$3A 

huxaan^5+3_correoted 

c5NA S 



CACKAATACEACC^CCTG?^ 



huroan_j5+3_corr*<3ted 



CAGCG?V3X5TCAAAGA&T^A 



bunja&__S+3_corr0Oted 
mn 5 cDNA 5 



AGXAGCT Qi^^TTCGAGATTCTG -<TT 



AAAWXX^TCA^AlXTrrCCATC- 
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ratjcDNA CftSOCft GGJ^&GCATCTTCOC^^ 

uuis cosa 5 — - — — - — — ~- 



mus cDNA 5 — — — — < ~ «— — . ~ > 



hu«Sa w 5+3jcorrected 



humari_5+3_corrected 



ret^cDNfc 

husSn_5 +3_oo rrected 
sms dm 5 



mus cSt3& 5 



ACTICm3TTGlXX&GGCa^^ 



mis CONS. 5 



AAAACXXaCATAGAAOC^ 



fcWm_5+3_corr*cted 
mis cDNA 5 



law cDNA 5 



ratjcDNA 

&unw_5+3_jcorreoted 



ratjcDNA 

htnaaii_5+3_corr€cted 



CAACTCTW^CCAGAMCC&C^^ 



rat_cWA 

huipen_54-3^_correot©cl 



CAGTC^TGACAmTQCTCCAACATCCAIACC^ 



rat_cP«A 
mas cDKA 5 
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mis d5s& 5 



braan_5+3_corrected 

was cVOXh 5 



Itts8anJ5+3_corrected 



humaj&_5 +3_corrected 

ecus crm s 



rat_cDNA 



hu»a&_5 +3_corrected 



ratjcDl*A 

htm5n_5+3_co rrec ted 
snis cDNA 5 



AAn^CTCCTACCTTCCCTGCATCTCT 



rat_cDNA 

human_5+3_corr^c t«d 



rat_cDN& 

htjraan_5+3_correct*d 



ratjcCNA 



huzaan_5+3_oorrected 
inns cDl^A S 



ratjcDNA 

huniaji_54-3_corrected 
mu6 cOKR 5 
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icmnaii^54*3_corrected 



AATGCXA<&TCCC3m3K^^ 



C&Ci 5 



ATCAScrraxiAi3?m»^^ 

TCCCfcTTTCmSOTtATGA^^ 



KtiinanJS+3_corrected 
szus cDNA 5 



TACAGATCTGCAGC?^^ 

TOCNNHNNNNCA — AATGCCAAGTTCA CHHA^C^GAfcCO^ 



ratjdftJA 
ittus dDNA 5 



HACOTOmZDACfreCtteti^ 



ratjcDNA 

hWan_5+3_oorrected 



bunian_S+3jcorr©ct«<i 
mas cDNft 5 



AIACTCAGACRTTGC!^^ 



h^aSo_5+3_correcte^ 
stag CDNA 5 



^TTOCAOAOGCCAGCACr^ 
GOCTGKXSQAGGCCAtXLftl^^ 



BHia oDKA 5 



GTGACTTT<amA<^AJCC^ 



huaan_5+3_<» reacted 



crcTAccxaaGacrc^^ 



rat_cDNA 

woe cdba 3 



CTATTCCRGCTAkaKa^ 



ratjcDKA 
bus cDNA 5 



CXMTCAf3K3GACC^^ 



ratjdDNA 



GGTT<XACQTGCTT^^ 
GGGTCCAGGTTCTOXXmT^ACCCtGTC^ 



ratjcDNA 

humaii_jH-3__co rrected 



human_5 +3_corrected. 
XSXX& CPNA 5 



CTGTGGTTTTTTACCC^^ 
CTGTGGTTTCCTATCCTCCCA^ 
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said CU&A 5 



rat_CDN& 

husS^5+3_corrected 



hus»n^_S+3_corrected 



hT3fflaai^5+3_corrected 
mus <M 5 



O^GCTCCCCCTGTCATTOT^ 



hutt&n_5+3jcorrected 



hiiniaa_5+3_corrected 
3ttus cDHA 5 



human_5+3_corrected 



ratjCDNA 



CTRCCftSTrCCftC^^ 



rat_cDKA 

hunto_5+3_cor*eoted 
«U5 c&lA 5 



hnfflanJBtSjcorrectM 
CDKA 5 



ht3man_5+3_CO greeted 



WU0 cDNA 5 



GATCCCTGTTTAT?<3GATC&^^ 



feu3aan,,S+3_corrected GAAW^AAATGGGSGA*^ 

KUS_cr>K7V_5 ' — — " ' 



38/90 



rat_cDNA 

humaa_5 +3__correate<i 



-mus oCWA 5 



^GCAAAGCTTCCGGCtCCCC^ 



h«nSi_5+3_oorrectecl 
mis cOKA 5 



rat__cDfc3A 

humaA_5+3_corrected 
UluS COMA 5 



^uman_5+3jcx>rr6Cted . 



CCTM"<3GCGAj[mGACCCX^ 



rat_cDSA 



rat_cDtJ& 
aus COKA 5 



rat_CDNA 
urns CD&k £ 



mus cDKA. S 



ratjODNk 

humafc_5+3 corrected 



hwSii_5+3_corr«cted 



human_S+3__cor r ected 



huEoan_5+3 — corrected 
nraS cDKA 5 
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rat_cCtt& 

fcinnan_5+3_corrected 
xhus cENA S 



GAG(^<$?U^GTGTGT^^ 



ratjcD&Sk 

hwan_5+ 3_correc ted 
was cDNA S 



rat_cDNA 

human_S+3_correctcd 



octctgtggm<jgoaacccccc^ 
gctct^t^ggtaao^^ 



ratjODNA 

human_5+Sjoorrected 



ratjoOHA 

huaan_5 +3_co r r ec ted 
aue cDNA 5 



h3cnaan_S+3j corrected 



xat_cDNA 

humarL_5+3_corrected 
arc* cDNA 5 



CaGGGACAGTftAAAGGCaXC^ 



rat_cc*iA 

buiPM_5+3__corrected 
ttm cdha s 



rat__d>t*A 

huiaan_$+3 corrected 
ma? COKA 5 



TCSAITCGAAAATACATACr^ 



rat_cDNA 

hn&an_5+3_corrected 
mas cBHA 5 



rat_ct>KA 

human__5+3_ccrrected 
JSUS oDNA 5 



TTCG^AAIG^5?GTAGC^^ 



rat_cDNA 

iiumaii__5+3_^corr«cted 
nuis 5 



^CCAG<^CAGGGGCAGC^^ 



ratjODNA 

h\mian_5+ 3_co r r ec ted 
xcols cDNA 5 



AAATC^CATGGQWSm^XTGRCCA 



rat_cDNA 

bunxa»_5+3_correct«d 
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hux?5n__5+3_coxrecte<J 
mis cDHA 5 



iat^nan_5+3_oorrecte<3l 



AmTX<^Gm^^<3AG?VTGAA itf£AIARAGT-<AAC»A-- CATCT5GGCA 



ratjcPNA 

hua&nj5+3_corrfccted 



r&tjCDRA 

human_5 +3_cor rec ted 
wis cENA 5 



M!ACATTTAC2^&raAAA3Tl3^^ 



fcOTan_5+3__ corrected 
TO* cDNA 5 



GCATTGTGAACTGM ACCGAGTCTCTG — TGGiVJCrcAAAGCAA^CTCTTMCTTM 

GGATTATtSAACI^^ 



mi* cEWA 5 



GGC&C^TTASOTTSCCAAC^ ACATTGAAACGGXTC7^CTAT 



rat_cDHA 
mis c5na s 



AAAATAACAAATGGCI^ TGAACTT- CTAAT 



ratjdDNA 

huixi5a_5+3_coxrected 



A CX2tfyrT(^CTAGTGTCCACCT^ A 



nus cDNA 5 



ATG<2USTCAACGG<3A^ 



rat_cC&A 

feura*n_5+3_corr€ct*d 



hunm_5+3 — corrected 
bus cDttA 5 



(rat_cDNA: SEQ ID NO:7) 
(human5+3 corrected: SEQ ID NO:8) 
(mus_cDNA_5: SEQ ID NO:9) 
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Figure 15 



huaxan_5+3_corrected KKTKC^GrTC^VSratflC^VM'PGG^ 
aouse 5 corrected MQKSGKEV$a^$XTAlCL^ 



rat I03ft£yKSIirRI,T^g^^ 



rat £KDTFtGl«G^V£LBLDHNlU 
iaouse_5_corrected EXDaMGLmnOBUSB^^ 



human 5+3_oorrected im$FXKTLYi£r*iFLTSiiPQEfc^ 
kbous e~5_corr eC ted IFKJSFIKXLYLYDNF-TSLFmiV^ 



rat IKCm^SSSPQC^LOmp^SKI^ 

mouse $ corrected . . — 



aotise_5 corrected — — — ~ — 



rat lO<VCSro¥HHIQFVKOIr^ 
huroan_5+3_corrected FivcNiDYGHiaPWQx;ua»rs^^ 

ttOttse_$_corrected — — — — 



rat imJ®Fm?Q&2KIW&^ 
buxi^_$+3_corrected D^sirapswiJiQDQisi^i^^ 

*OUSe_5 corrected - — — « Z 



rat PSO^HTVLV^TIiU^tt^D^ 

aouAe 5 corrected »■ — — — — 



r*t , Al^FIft^YHCI$TNO&I^ 

huaaa_$+3_corr«ct«d ABSFDTGVmJISSNXDI^ILT^^ 

»ous* 5 corrected — ™— — • — — — 



rut PD^ISWILF(5Wra^P$R^ 

joouse 5 corrected — — ^ 



rat SVOKKGO£MVEHDEEM(^(^^ 
huc^_5+3jCGrr«»cted SVKMKGQRPLEHIX^TEGSGLt^ 

mouse 5 corrected — ■* — 



bOT»nJS+3jcorrfccted YRELTLQRRGDSTHRKEl^JNRRHFPPSARRr DPOHWAALLg KAKK^J&CKRENTTYSPP 
Hioiise 5 Corrected ♦ - ™ — 
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hui^_5+3jcorrected tOTTQLFNlFGKEDDSSGtll^^ 

»QVLse~5 corrected — 



rat EVS-TVHPQTLQSEHl£t>^ fiV 

huran_5+3_corrected EFSPVVNSQILmEHJDrarSXfclKT^^ 

©OUS e_5_corrected — — — < — 

rat *2a3tf^QAQlAS--$0$A^^ 

mouse's corrected — — ™— — — — — 



rat mGV$WSSSDISSHXXAl»?SFSSHPSGS^ 

saou$e~5 corrected ~ — — _ 



rat IlfSR^VS&JEERtmaitHB^^ 

l&Ottge 5 corrected -« 



rat TAALSVPSSSH&ujPK^^ 

saotise*5 corrected — — ~— — ™ — < 



»OW«e 5 corrected — 



rat X$S&8B3MBmX&&m^ 

©ouse""5 corrected — — ~ 



rat mSSFS&U&TSlMXCNBS^^ 
hiaasn_S+3jcx>rrected TISlSVtfQIPSWOTTAHHT^^ 

mouse 5 corrected - • > 



rat 2£!EfKMflrSraSA9V^^ 
human_5+3_corrected ISTQlAlFflTTPTn&SVlTYETQ^^ 

»rose~"5 corrected 



rat S^TTADTnAFSHSPRQDIX?(RrV'3aVjftYHSTTS — LIAITja^KrTOTLGNTTMxETT 
htnaanj5*3jcorrected TSMTT — PAIA5THSPPENTO3I $5TI SFHSRTI^LTD^EBLAQJVSTCn'I^TIASSTT 
moused corrected — - * , ; r 



rat I£SK$QESTTVKRAS-m^-pm<$e^ 
human_5+3_corrected LSSKSHQSTTTRKASJ^PIPPFLSSSA^^ 

jbous^Ts corrected ____ 



rat VTm^fcHNXBI l gQe&«^^Pm 

&«n©n_543_corrected VKLH£S$$Btf--X$3K£SSC&EP J.TSeTSl^IJCSTPMH^TTVKSQN^KLT^SP 

aouse 5 corrected 



rat ygZHKYQX^^SSTIEKGKRPAVSMSFB^ 

kura&n_5+3_corrected WAE1QFSiniF3?YSDIA£K^ 

mouse's corrected — — - ^ . 
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mouse~5 corrected — - — 



hman_5+3jeorrected DLSSm^NSRVQVtrPNGTI^^ 

toovies 5 corrected — 



mouse 5 corrected — — — — , ™™— ^— 



rat LSX,YPi^mCV2^S<^ 
human J 5+3_corrected LSIxmOFyKCVASUP^ 

mouse 5 corrected ~— — «- : 



rat F0*$vmVLYlX3mtftfI&Ll!HSRm 

aaoue e__$_co rr ected. — — — . 

rat VILTVEEGETIPRIETkSOKim^ 

human_5+3_corrected VMLTH^lOTSPRIimSQXRTEVKFGD^^ 

xrcmse""5 corrects : 



rat GSRlHVYPNC^WGSVTEKDfta?^^ 

ttOtts^_5 fc oorr^ct^d 

rat VIEG^WX^K^SFTO^ 

wuse 5 corrected — ^ * — ™ 



rat GXAE^Yl&JmttJ^^ 

s&ouse S corrected ~m~m— 



rat imVWHXSmnSZSMXtFTFB^^ 
hm^_5+3_corraet*<l KPKIFWIJtfSNiraSmOKraFHAHC^ 

aousft 5 corrected — 



*ousfe 5 corrected — — . — 



rat VTVHPNGTLEKBtfllU^DSJ^ 

mous«~"5 corrected ■ — 



rat QAGKPVM^lKSVDGNPPPEIT^^ 
fcuman_5+3_corrected QXX^CS7AI^CSVDGNPJPEIIWII*PNOTJ0P$m 

jrou?^ 5 corrected . — _ 



rat RCAARNXVGYI EKLI LLE I GQKF HXT YE£K$MVKSVSG£ PLS LHCVS DG I PKPNYICNTTP 

iiuman_5 +3_correc ted RCZ^USU^GXIEXLVTLE^ 

tstause 5 corrected • 
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rat <3OTfi^^Vp<3KYILHEH^ 
humn_5+5_corrected SCTWDRPQIHSsanira^ 

jaouse_5_correcte<J ; — » — 

rat xr^tttrnw^B^om^isc^am 
rat imm&svs<m8&&^^ 

iiOTan_5+3_correeted LVI^IPQTSDSGI YKCTAKNPLGS KOUITYTQV 
nous e_5_corr ected , — 

(rat: SEQ IDNO.10) 
(human_5+3_corrected: SEQ ID NO: 1 1) 
(mouse_5_corrected: SEQ ID NO: 1 2) 
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Figure 16 



rat 

taaan 5+3 corrected 



rat 

human 5+3 corrected 



rat 

h\jman S+3 corrected 



rat 

^ human_S+3_corrfectftd 
husaan 5+3 corrected 



rat 

hvat&n 5+3 corrected 



a htonan_5+3__co rrected 

!'4 sat 

Ilj fcxsasiiJ5+3 fc Corrected 

U rat 

I H human 5+3 corrected 



rat 

human 5+3 corrected 



rat 

tmflvm 5+3 corrected 



rat 

huaau_5+3_corrected 



rat 

human 5+3 corrected 



rat 

Uxmran^S+^^corxected 
rat 

fcu»aaj5+3 corrected 



rat 

human 5+3 corrected 



*«*•** *. 

********* ^** *,** - ** : ************* » ^*****^ **«************. * 
QKD1OTGI^SLTRI£MDfiNNiEFINPEYFXGliN 

******** **^*************** t ****^ ********* t **************** 

*** ****j *j **********«.*******£* ;*****Jf««* «*****«***•«•* **• 

******** ^**^********** »**.*.*** - ^** *.***** w ***.**** t **«♦ 
** *** a *« ********** A**.*.************* ***************** 
***_ j**,*******-*************^ 4«**4 4 r ,***** ********** 

*.«*** * : *;;* ir****;****.********-****.*******^ . ***** ** 
.***; ******** % * *** 5**5 3 ***************** fc******^ ******* 

*****-*.*****.* ************** *+ ♦ ; r*» ******** ***• 

******** J***^*. * 4 ***« * ******************.* J ********^** *3* 
** : **** ; *** * ; ****,*,*. # j* :*** 

******* *****|**» i* ********^ ************ j*^******* # * 

HAVPIiVJ&TDEEKBASG^PP*^ 

*;.* * i * ****** ■*** **** **** 

BVS-r^Kffl^EJO^ra^FSVTNCT — SV 

*-* .**.* * .**** : . : ;** 

YEFQDSDQM(^GR£HFC^RP — LLESVNTTNSfi — QT 

* T" * _ ** *J* 4 * „ » ^ « **- + ^+ «*. ** *.*• * 
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rat 

buaan 5+3 corrected 



rat 

humans +3_cor rect ed. 



rat 

husian 5+3 corrected 



rat 

tiu2^^5+3_corrected 



rat 

human 5+3 corrected 



rat 

human 5+3 corrected 



rat 

human 5+3 corrected 



rat 

huaianjj +3_correct ed 



rat 

human 5+3 corrected 



rat 

httman^5+3__corrected 
rat 

human 5+3 corrected 



rat 

huroan_5+3__correated 



rat 

human 5+3 corrected 



rat 

human__5+3_corrected 



rat 

human 5+3 corrected 



rat 

human 5+3 corrected 



rat 

human 5+3 corrected 



♦ * **„* ^ ^ : **** * * ; * * * *.** * ;: ** ; *^ 
* IflSi^KVKNPHRIFVI£BfiRHm 

* , **** .+* ********* j ♦*#Jt*««t « ,** * j* * ** * 

EAMSHTS^IEFPKADIWVPSEEST^ 

****** **j- ;s**i: . * } *****.**.****; :: * ^.* ** * 

** * ** ****i. s * : j « «* :j :*** * ; ,*_ %jl< * .* 

TSTKFSKMCTPIH<UFV^QKKE<3MI^ 

3 «;**.** * i* *.* JJ„***.,*^ *** : * ** **** ^ * 

** ** . , **.; *: **, .*:*** *** * - * 

i 

SnTOUPATTPTFPASVmET^ 

**:*♦ .**** *** **« « * : * i*** * * •* * **.., ♦ 

aMTP — PAIAFTHSFPEHTTGISSTIS FHSRTL^lWIEEUAOaSTQfrLKSTIASE TTL 

*!*• ..***«*** * * .* ** ; **** .* * **** 

SSKSHQStmK^Dmmi^SATI&PmS^ 

U** *** ** : ***.*, .* 4 * ^**** 2 .***.. * * M ,* ** 

TI¥E$ SSJ£tfTDLQQP$AEASPKPEI ITGTTDS PSNI^PSTSVPiU^KVDi^OHSKWKPSPW 
KLHESSJ&N — LQMPSSQLEP LTS STSNlJiES!ITMPALTTVKSQNSKLTPSPW 

„ ;;*****♦ ** **. ; <+ *^***j **^.*** ^ *.**** ,**** 

****** *** # * 3 ** ; .,** ***** 

TSKLLPFDSI^mPEKPRlV^ 

*** **s **,t 2 ::****j******** •*****„******** j*** ******* * ; 

j*;*,*-**^, ************ ^ *********** * : »_** ? ** ? **** **^*** ; « 

HVKEITVHFGSrVELKCRVEaiPjaWSir^^ 
RTKEITVHSGSrmKCBAEGRPSPTVTWII^ 

; ,****** **********„** * ***.*********** u j***;. ** ****:: 
SLYDRGFYKCVASHPSCSQDSI^^ 

* ; ************^ 4i *****^**** 1t:1t1tit4 ^ :+tj ^^^ Jt ^ ^ t+^nm+a 
OPSVHWVLYMTELPCPIiC^THSRFFLYPNGTL 

«**+,*** ****.*+**,*,*, ^ ***********.*****★* 

MLTHmWTSPRIEAASOKRTS^^ G 

* **** : **** * 
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human 5+3 corrected swiHVYPNGSLFIGSVTEKDSGVIt^ 

— - — * ********* ^ J ******* J* *************;**** ***^****; «****-*** 

human 5+3 corrected IilGKDFQVDC^<^P\re£lSfl^ 

~ *~ **************************** H** « ******•**•*****;*******;** 

rat MM^OTCSaQim^KEEI^^ 

human 5+3 corrected VM£GD¥TC2£jC£OT<3CDEM 

~~ — } ****** * ****************j** ****** ** # *• ;**$ ************* 

rat PNVFSOliPSHmSFSNK^ 

human 5+3 corrected SiamLPSflPtiieFSIERroSH^ 

~ * ; : *****«* : .**** **j***** .*«*;*** ******************* **«*- 

rat VSKPPUM^ONKT^ 
hitman 5+3_COrrected " VSKPPIJHGLYTNB^KA^ 

*" *********** 3 *. a******-*****.******* **.;* »«**,****,,***;***« 

rat TVmi<3TLEi4^1U,3DSJ^ 

htn»an_5+3_corrected TVHKNOT»EIIttn/J&SDSAD^ 

*** *****.**•******** *****************************; ; *± 

rat AGKFVW^SVIK^^ 

huaaa S+3 corrected mksi^C$V1X^PP££IIWII^^ 

*** **^ ************** ****;**•*;« **j*^***-*< **** ; ^.**** 

rat CftfiBKJWG^EKCtllJ^ 

hwnan_5+3 corrected caftSJ^OTEKI*VII£lG$!CF^ 

*************..************ ** **;*****************•*** * 



rat 

human 5+3 



rat 

human 5+3 



GHVIDRPQVDGltfrLHKHG^ 

corrected Gxvvv^m&xn&nmtv^^ 

*«*«****- •******!******* *****;****** J J 5*t* - **«** + ***r* 

corrected mPPRSIVmG*^I£CVM^^ 

* *** *$********«***;«*** * *****^*j „ - , * ** ** **** 



rat 

human 5+3 



corrected VICm^SDSOtm^AKNFI^D^^TYIQVI 
**** ********** *j* **-***.*****< 

(rat: SEQ ID NO: 13) 
(human_5+3_corrected: SEQ ID NO: 14) 
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Figure 17 

MQHIGPJBVSCIXISLTAICLVVTPGSRV^ 

VNLGYNSLTRLTENDFSGI^IILELLMLHSNGHIRVSDKTFSGLQSLQVLKMSYNKVQIIE 
KDTLYGLRSLTRIJfflJC>HNNIEFTNPEAF5f GLTLLRLVHIJBGNRLTKI.HPDTFVSLS YLQIF 

Kl , SFIKXLYLYDNFl\SIJ^KEMVSSMPNLF^LYT.HGNPWTCDCHLKWI^EWMOGNP 
(SEQIDNO:15) 

Figure 18 

MKVKGRGITCLLVSFAVICLVATPGGK^CPRRCACYMPTEVHCTFRYLTSIPDSIPPNVE 

RINLGYNSLVT^LMETDFSGLTKLEIJJVILHSNGniTIPDKTFSDLQAL 

LQKDTFYGLRSLTRLHMDHNNIFJ^PEWYGI^^ 

QIFKISFIKFLY1^DNFLTSLPQEMSYMPDIX>SLLYLHGNPWTCDCHLRWIJSD 

KCKKT)RSPSSAQQCPirMNPRTSKGKPLAMVSAAAFQCAKPTIDSSLKSKSLTILEDSSS 
AHSPQGFMAPFGSLTLNMTDQSGNEANMVC^IQKPSRTC^ 

VCMDYGHIQPVWQE.ALYSDSPLILERSHLCSETPQLYYKYKQVAPKPEDIFTNIEADLR 
ADPSWIJS1QDQISLQLMITATTFS 

LEHTVLVGGTVGLNCPGQGDPTPHVDWLLADGSKVRAPYVSEDGRILIDKSGKLELQM 

ADSFDTGVYHCISSNYDDADILTYRITVVEPLVEAYQENGIHHTVFIGETLDLPCHSTGIP 

DASISWVIPGNNVLYQSSRDKKVIJSfNGIlJlELQVTPKDQGYYRCVAAN^ 

SVKMKGQRPI^HDGETEGSGLDESNPIAHLKEPPGAQLRTSALMEAEVGKHTSSTSKRH 

NYRELTLQRRGDSTHRRFRENRRHFPPSARRIDPQHWAAIJLEKA.K^ 

SPPPVVTQLPNIPGEEDDSSGMLALHEEFMWATKALNLPARTVTADSRTISDSPMTNIN 
YGTCFSPVVNSQILPPEEFTOFRXSTAILTTAM 

ATEFQDSDQMGRGREJ^QSRPPrTWTMIKDVNV 

SEPRHNHFYSHTTQn^TSTFPSDPHTAAHSQFPIPRNSWNffLFRRFGRQRKIGGRGRn 

PYRTP\T^JRRHRYS1FRSTTRGSSEKS1TAFSATVLNVTCIJSCLPRELTTATAALSFPSAAPI 

TFPKADIARVPSEESTTLVQNPIXLLENKTSVEKTTPTK^ 

SIPMEKTHKVNASYPRVSSTNEAKRDSVITSSI^GAITKTPMTnAITPJSRRKffWQQNFV 

NNIiNPKGRIJ^NQTlKVSLQKSTAVMLPKlSPALPQRQSSPfflFRRLSl^SVMQIPSNTLri^ 
AHHTTTKTHNPGSLPTKKEIPFP^^ 

TQTERSRAQTIQREQEPQKKNRIDPMSPDQSSGFTTPTAMTPPAIAF^ 

ISFHSRTLNLTD V1EELAQ ASTQTLKSTIASF^I^ SKSHQST1TRKASLDTPII 3 PFLS S S AT 
LMPWISPPFTQRAVTDTRGDSHFRLMTNTVVI^^ 

IISTPMPAElTVKSQNSKLTPSPWAEQIWILKPYSDIAEKGKKPEVSMLATrGI^EATTLV 

SDWDGQK^TKKSDroKKPVQEAri^SKLLPFDSLSRYIFEKPRIVGGKAASFriPANSDAF 

LPCEAVGNPIJ>TIHWTRVSGLDLSRGNQNSRVQVLPNGTLSIQRVEIQDRFQYLCSASNL 

FGTDHLHVTI^VVSYPPRILERRTmTVHSGSTVELKCRAEGRPSPTVTWILANQTVVSE 

SSQGSRQAVVTVDGTLVLHNLSIYDRGFYKCVASNPGGQDSLLVKIQVIAAPPVJLEQRR 

QVIVGTWGESLKLPCTAKGTPQPSVYWVLSDGTEVKPLQFTNSKLFLFSNGTLYIRNLAS 
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SDRFITECIATSSTGSERRVVMLTMEERVTSPRffiAASQKRTEVNFGDKLLLNCSATGEP 

KPQMRIJ'SKAVVDQGSWfflYPNGSLnGVTEKDSGVYLCVARimiGDDLlLMHVSLR 

LKPAKIDHKQYFRKQVLHGKDFQVDCKASGSPVPEISWSLPDGTMINNAMQADDSGHR 

TRRYRIJ^NGTLYPMCVGVAEEGDYTCYAQNTI/jKDEMKVHLTVIT 

KRIKAGDTAAVLDCEWGDPKPKIFWLL^ 

EYVCVARNPSGDDTKMYKLDVVSKPPLINGLYTNRTVIKATAVRHSKKHFDCRAEGTP 
SPEVMWIMPDMFiTAPYYGSRIT^^ 

VIIMHtRPTFRNPRNPFhffiinV^ 

QYLIASNGSFnSKTTREDAGKYRCAARNKVGYIEKLVIIJEIGQKPmTYAPGTVKGISGE 
SLSLHCVSDGrPKPNlKWTMPSGYVVDRPQINGKYIT.HDNGTLVIKEATAYDRGNYICKA 
QNSVGIiTLITVPVMTVAYPPRITNRPPRSIVTRTGAAFQLHCVALGVPKPEITWEMPDHS 
LLSTASKERTHGSEQLHLQGTLVIQNPQTSDSGIYKCTAKNPLGSDYAATYIQVI (SEQ ID 
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TCn^QAAGTAAAAT^ 

AATCATCTCTCTOTGOTTTOO^ 

ACCTACCGAATACAGAGCTATGACGre^ 

GTAGCTXKX£AGAACATACTAGATGCC^TAT^ 

CTTAAGAATATGGTTGTCTCTGC^^ 

GGTTAGTGCTAACTCTGACTXK^TT^ 

ATC^CTTGGTGAmOCAACTGGAW 

CTCAGCTTTOGCAGCTC^^ 

ACimTICTAOTACAAGACCC^m 

GAAGAOTCAATTAGACATATICTQATGMGTAAGCATATA^ 

TGTlTOATGTATGCICATGGATOCrATrATAAIOT 

AGCTTATGATACTTGTCACAGAGTAAATCTTO 

TTTCATAATCX^TCAAT^ 

CTGCCCTAGGATCKmT^ 

AGCUGCACATCTTCAGAACATCTGCC^^ 

ACICAC^TOTOCTTCAA(X^(^GGGATTAAA 

GGAAGCCTOCTAGAGGAACAGGATGTATCAGGATAA^ 

ATAACCCTC&CC^CAACAMCA^ 

GAACTAAGCCATATCCAGAAAAAGT^^ 

AGTC^TATGCAGAATATAGACAAAGAT^^ 

CTTGTAGATTAGAGGACACAAAGCMC^ 

TATAATATGAGGACTGTATITAATA^ 

ACTAACACACACACTTACATAGAGAGAATAAAGTGATAGATACXjr 

TAGCTGATAATTTCATATTGTACA^ 

AGGTGCCTCIXXXttGGGCGG^^ 

CCACATGGAGGAGCTAGAGAAAGTAGCCAA^ 

TGGAACAAC^TTATGAGCTAACCAGTAC 

AAAAGATGGCCTAATCGGCCATCACIXKJA^ 

ATGCCCCAGTACAGGGGAATACCAGGGC^^ 

GGGGlXKKiTGGATATGGGGGACTITIX}GTATAGCATTGGA 

CTAATAAAAAATGGAAAAAAAAAAAAAAAAAAAAAAAAAGGAAGG 

TTCACTGCTATCTCAACTIXKiAMCAGM 

GATTOAAGCGTCTITGMTGT^ATrGW 

TGTGAAAACTTACGGGGTCTATGAC^TGTTT^ 

ATATTTGGCACCACTAC^G<XCCAGCCCCC^ 

CACACACACACACACACACACAC^^ 

CATCATCTCKTrCACI^ 

CTATTTITATGTCTAGGTlXXXXXritXX^ 

AQAATTTGTAGCCCX3X3A<KX&GCCC<^^ 

CAGGGCTTACCAGCACAGGAGGAGCAAACTeG^ 

CCCGGGTCX}ATGGCCCGCCCATTCC^ 

CXXTCCTCCCGAGTCAAGA 

TTGGCCCAAGGCAGAGGTGAGCGGCTGCTAATCC^ 
AGTCTGGTAGATGTGGGAGCTXKXK3TTCAGG^ 

GGT(X&GGAAAAGGAAAGTACTGGAGGGGAGTTOQGACAAA AGCAGCQACCA AGGGAA 

CATCXK7ITCAOTGA(^AAGCCAGGC 

OyrciTCATAAAC A CTGATGACQTGT^^ 

Q<^ ACGTCXK3 AAA,GAAGGAGTGAT<X}ATTAGTACTTGTA A GTTTAGGTG AQTTTG AGAAC 

TAAgEAACCTATACr^^ 

C^TCAQATAMqqAMQSTmHS^^ 
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TATTGCCAACTCITITGTT^^ 

CAGCTACCTTOTACX^TCXnTGriTAT^ 

ACGCAGAGAGTATTCAAAAATATT(^0^ 

ATCATTAGTAAATCTTAATTTGGGTTAAAOT 

CACTGAGCCTTQ3CCACGTG<^5GGITAAAGA 

CTTOGCCATGCltXCCATC^^ 

AGCAGACAGTGTTATGOCTOC&AAGTC&GAGACT^ 

GIXXrrCCCGGTCATrAGTTGGCTCIXiAAAA 

AGACATGCTAGAAAGAAATrrGACCTrriT^ 

THXKjAAGGITAAGCXJAAATAAATAAAACACATAT^ 

GATCATATAAAGAAGTACArTCATAAATGCAATGTGAAAAAT^ 

TTACTGGTGCAAAGTTriXrrAAATrGCAGATGrACT^ 

AAAATGTATATAAAAGTGTAAAAGGCTCTTGGTC 

TTAATTGTCCCGATAACATTArTATC^ 

AATGAGMGCTATGAAGATATCnTTACMTAATMGCC^ 

GGAAGCnAGCACACACTGGTTTATAAAGAGAACAGTTIXXrrGA 

CTCTAGGGAATAAGAGTIXXTCTTTCIXX^AGATT^ 

TTXrrTOATOATGAG ^TTA TAAGTGGAATAAGATA 

ATGCCCTGAACAATCTTTCAGGTCTT1OT 

TTGGGGGATGGTGTTAATTTrTTTGCAGTTCT^ 

AAACAAACAAAAAACCTCTGAAACTAGAACT^ 

AGAAATCTXK^CAGAATTTATIXSCTACATT 

ACGAACTTACGTAGCCGTCAAAATAIXJAACGGATA^ 

AGAGTOXATOTGGCCATAAAAGAGT^^ 

GG AAATCAATTGGGCTAATCAAAACAAGACAGACT^^ 

CTCTTGACAAAC^GAAGCTAGATTTACACTTGTACG 

TAGTTATACACTATICTAATCTOTGAGTGTCT 

AGCTAGCTCGGGTCKXnAG^^ 

GTAACATAGGTCKXAGGATOAAATGCATrAATTO 

GGCAC^CGTIX^AACCAGGGTGAAATCCCAGCACAGA 

TTCG<XACTAA<XAAGAAGCCATT^ 

CIXX^GTCTGACACTGTGTATAAC^ 

11 1 1 iGT OCTAT TTITCAT A 1'mTlUiTACIOTIT^ 

GCTGTGGTrrTCATTIXjTTTCT^ 

^GOXKJAAACGATCTGGAAGAAGTTOGG^ 

ICAAACAAACAAACAAACAAAAGGTTC&TTTTGCC^ 

2AGTTACGACTCITAAAGAAAATATTCCCA 

^AGGACTTTGCITGAACTGGCC^^ Artificial . 

Noffrfte 

(SEQIDNO:17) 
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cacagaccttccixot^ 
cagcaagcacactctgctaatgc^ 

GCAGACCCACACGtTTXXAACnXKTrc 

GAAACCTXXnrGTGATCTCCAm 

TTCCTCAAACCTTTCCT^^ 

TCATTGGOAACXOQTGCTCCATCCA^ 

CTIGGTCAGTGCAGTXTT^^ 

AACTGTTTTrTGTTATOATTGCTGCAATX^ 

TcraTiOTcrccr^ 

ACttAMQl^TATAGATTCATGGCATK^^ 

ACITXKJCAGGTGTAAMG<X3GG^ 

CTGGAAOmX3G<^TTCCCATAGCT 

GGTCTTGCAIXXnfTTCTK^ 

TAGCAAAAGTATACCTAGAATCXTC^ 

GAAGAAAGTIXKJTAAATTC^TCTGACAAATGGAG 

AGCrAAGTCACACTCATATGCAAGAATTTACCAAGG<XT 

GCTTTATTGATGATAAGGTCTGCTAGAGCAGAAC^^ 

AAAGGTGTAGCATAAATTGTGACTAGAGTOTGAAAOT 

GGCAGAGCTOTTTTATCTrrACTO 

TCCTCTGTTn'GTGACAGATACTICTATG^ 

TirCIGTATTATATMCK^^ 

CCITGTGTCTGTGTCCTT^ 

GTCTrnXXACAGATraAG<KjAGGCCGA(^ 

GCTCACTairGGTCAC^^ 

AAGTGTGGGTCCAGGAAGCAGGTIGTACACCCTGrcT^ 

AAATAAGTTMCCCATCTCrrCCTCX^ 

TGCTGCTGGTGTGTCTCCATC^^ 

TGTC^TTGTTTCTTCAGTTCCTC 

AATCTGGGAATGACTITrCTAAGAAATGCCAC^ 

TAAACAAGATATATGGTGCGCAGTTTATAATCATAAT^ 

CATTCTCATKnXKnTCCACGCTGA 

AAAGTAGTGATTTATGACCTGCrrro 

ATOTTGAAACATGGAACTTGGGGTGACCTO 

THXITCTCCAGAATAAAT^ 

CG<XCGTGGAGTCa^CCATT^ 

ATGTGATlX^fIX£AACTCC^ 

ACTAAGTCTATTGATTTGGAAGATCT^ 

TCAGGCTAAGCTGAAAAATATGACAMTGAAATGTCAA^ 

AGTGTTAGGGGGnTTAAAGAAACAAATACCTACT^ 

CTATTCTAGTITrGAATAATGTTCCXDTAGT^ 

TTATCTTGACX3AATCTCAGTGACTTG^ 

AGGTCTCATTCAACCACATGm^ 

ATItntCCAGAGCACrGOAATTGGGTO 

TGTGCH^AAACAGAGAGTTACAAGACCnrre 

lt>ACAGAATGTCAGAGTOGAGCTCXX^CTGGG 

TTGGAACATC^CACAAAGAAATTCCAAATTATO 

ACTTTTOACTCTTGAACATGAGCITI^ 

GTGAAGGAAAG<mXj<XAACTCCTTmATATGTA^ 

GACAGTATG<XAGTCCCAAGATCTGTC^^ 

AAGGCAGGAGAAGGCAAGCTACAGTGAAGCCCAGTTCACT 

AGATAAGAAGGGTTTCTAATTTTTAAATTTTT^ 

AAATGCTATCCCAAAAGTCCCCTATACTCCCXCCTC 

CCATITCrrG<3KXXrrGGTTAOT 

ACTATCTGGTCTATTGCAAGAATGGCTGCCGAGCIG 

AGCCAAACAGTOGAAGGAACTTOGGGACTOT 
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GCCCCXH3AIXKKK3AAA<X^ 

ACCXTIXXKKKnxn^ 

GCCTCTTCACTCAT^^ 

GATGGQGGTTAGOOOTOO<jK}A1X3<^ 

TGTTCTTACTOAGCTGCCTAGl^^ 

GTAMGACTTCAACnTOT^^ 

TAGGAAOAGAAGA<WAG<XXKJMGACT^^ 

ATGAATAAGAAAAAAAAAATAAAAATAATrAAAAAAAAAAAAGM^ 

TACTCTAAAAOCATIXXSAT^^ 

GATACTAACAATAGATOTAGGAAACTATIGTrAAC 

GAACXnTrmATTGAAGCTTK^ 

TAGCTTAATGATGTTCAGTGAAITAAAMTMCA^ 

AAGGATTCITCGTATATATrrAAAAATCrAAGGl^ 

tttctatatatgtitacrctatcx3tatgttm 

caac&tagtatgtaacttataagaaact^ 

<^accctcagaaacactgggttix3aat^ 

agaaigtaataacctactaaggcaaacatagaa 

gaagtaataaggcaaataatgaatattxxk^ 

gggtgtgattctccagacttgactccai^ 

ttgaaacaatgtgttoaaaataa^ 

tixxtitttraltgckjtqataaggaccatg^ 



(SEQIDNO:18) 
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Hamut 
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Figure 50 

taagccttttgctcccgttggaagcaaagaacgttcctrcaatcaggtgaaggctct 

gctcagaagatttcatgtctcagcttatgttacaagaggattcaaaagcaagacaga 

agagctcaggtatagccaactcttttgttaaatacagtatgaggcttaagtgtacgg 

caactcatgtggtattcatttgcggctctcttctctrataact 

tatagtctctrctgtttcccagctaccttgcaccatctltgtttatctaataata 

agctx:atctgctttttaatcatcacgcagagagtattcaaaaatattcagtgatgta 

acagtgacagtgtaggcatagaagtaatcattagtaaatcttaatatgggttaaact 

cattcataacagctccaggttgg (seq id no:19) 



TTTGGAACCAACCCAGATGCCCCTCAACAGAGAAATGGGCCAGAAAATGTGGTCCA 
TTTATCCAATGGAATACTACTCAACTTATTAAA 

GGCAAATGNATGGTCTGNAGGATCTTGAGTGAGGTAACCCAATCACAAAAGAACAC 

TCATGGTATGCACTCACTGATAAGTGGCTATTTGTtn'ATGGAGTGATTTAAAAGGGA 

AGAAGACACATAGCTTTTTGTGTGTATAATATTAAGATGGAAATTTGCCAGTGCTGT 

TTGGCTTATGAGTGAATCTTGTTTCAGTGGATTACCGGAAGAAAATAATAAGTGAAC 

TGTAGGAAGAAGTAGTTAATCAAGGTGACAAAGTATCCTGACACATTGGGAAAAGA 

CCACAGTCCAGGAAACTGAGTCTTAAGGATTCATATTAACTCCAGTTCCCCATGTGC 

AGCTCTGAGACTTTGGCAGATCAGACACTTAACTTCACCAGCTTCCTACACAGAGCA 

GTTACTATCCTTGCCTTCACACATGGAGTGTGCCATTAAGTGCCTGAACATGAGTCT 

GACTTGTTAATAATCTTTAAAATCCAATTGTGTGTAAAGTATGTGACCAAAGAGCAT 

GGTCATGCTATTAACCTTTGATGTTCT 

ACAAGCTTGTGGAGGCTGGAAGATACAAGGCTTAAGAGGATGGCCTITCAGTTTTG 
AAAGTAATTCAGTATGTGTTCTGGCATCCCTTTTTCCTAAAGCAATTTAACCCCCCAA 
GTAGGCATAATTTTAATOCTTAOT 

GACTTTGGTATGCATAGGATCTAAATGTAAATGTGATTTACTGACATAATAAATAGG 

AGAAACTGAGCTAGAATAGGTATAAAATATGTGCTGGCTTTCTAATAGGTCTTATAG 

GTTATATAAGAGGTGGGAAAGGAATATTTGAAACATCTAGAAGTAAAATGATCCTG 

AGTAGCGATCCTGGGAAAATACGTACTCTAACACACTGCAATCATCTCTCTGTGGTT 

TGCTGGAGCTGAGGTCTGGAAGGCTCGACCTTGGTTAGAAATAACCTACCGAATAC 

AGAGCTATGACGTTAGTCrGGAAGGAGCTTTGGAAGAATGACAAGCTGTAGCTGCC 

CAGAACATACTAGATGCCATATTTCCAAGGCAAGTGTCCACATGCGGACATCTTAAG 

AATATGGTTGTCTCTGCAGTGCTAAGGACCTTGTTCGTGCCACACAGGTCTCCAGGG 

TTAGTGCTAACTCTGACTGCTTGACTCTITAATTCTCCCTTGATCATTAATGACT^ 

AATCACTTGGTGATTAGCAACTGGATATGGAATATTACTTAATTTGTACCCAAGCCA 

GGCCACCTCAGCTITGGCAGCTCCATTCATTCTGTGGAGCCCAGTCACGTGGGTTTG 

AATCAACTGTACTGTITCTACTTACAAGACGCATTACCTGAGATGAGTCATTTTTCTT 

CACAAGTCTTTTTAGAAGAGTCAATTAGACATATTCTGATGAAGTAAGCATATAAAG 
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TGAGAGCAGCATGAATGTGTTCCATGTATGCTCATGGATGCTATTATAATGTGGAAA 

TAAACTGACTTTAAAAAAAAAAGCTTATGATACTTGTCACAGAGTAAATCTTCCATA 

AATATCATCTGCATTTATAAATTATmCATAATCCATCAATTAAAAAC(^TrAGAAA 

TTTTGTTAACACAAAGATCCCTAGGCCCCTGCCCTAGGATGGTCTGTATGGTGGGCC 

TGAGAGATGGAGCTTAAGAACTTACTTGCTCCAGGAGCACATCTTCAGAACATCTGC 

CTCAAAACATTTATCCCAAATGCTCATCAAAGGCTCACTCACATGTGCTTCAACCAC 

AGGGATTAAACAGTCATTTTAGTCACATTTCTCAAACGGTGGAAGCCTGCTAGAGGA 

ACAGGATGTATCAGGATAACATCCAACCTTACAAAAGGATGTCATAACCCTCACCA 

CAACAAACAACAACGACAACAAACCCATAAAAATTATCACGGCAAATGAACTAAGC 

CATATGCAGAAAAAGTATTATATGTTCTCATTGTGGGGTGTTTTTCCTTAATA^ 

ATATGCAGAATATAGACAAAGATGGTrTATGCAAGTGGGGATGGCGAAGGATACTT 

GTAGATTAGAGGACACAAAGCAACAACTACAGAGTGAAGTAATCGAGAGACTTAAT 

GTATAATATGAGGACTGTATTTAATAATTCTATTTAAGATACACAGCAAACGAGTGT 

ATCTTACTAACACACACACITACATAGAGAGAATAAAGTGATAGATACGTTrcTTTT 

ATCTTCATGTAGCTGATAATTTCATATTGTACACCTCAAACATAGATAACCAACAAA 

GAGGAAGAGGATAGGTGCCTCTCCCAGGGCGGAAGAGTACATTCGAAAGTCAGACA 

CCATTGTGTAGATGTACCACATGGAGGAGCTAGAGAAAGTAGCCAAGGAGCTAAAG 

GGATCTGCAACCCTATAGGTGGAACAACATTATGAGCTAACCAGTACCCCGGAGCT 

CTTGACTCTAGCTGCATATATATCAAAAGATGGCCTAATCGGCCATCACTGGAAAGA 

GAGGCCATrGGACTTGCAAACTTTATATGCCCCAGTACAGGGGAATACCAGGGCCA 

AAAAGGGGGAGTGGGTGGGCAGGGGAGTGGGGGTGGGTGGATATGGGGGACTTTT 

GGTATAGCATTGGAAATGTAAATGAGTTAAATACCTAATAAAAAATGGAAAAAAAA 

AAAAAAAAAAAAAAAAAGGAAGGTCAGACACCTCACTTCACTGCTATCTCAACTTG 

CAAACAGAAGGGGAGTCACAAACCCAGGACAAACCACAGTGATTGAAGCGTCTTTG 

AATGTTATTGCTGTTGTTGTTACCACCATCATTAGCATATATTCATTGTGAAAACTTA 

CGGGGTCTATGACATGTTTTTlTATrcAAGTATATCACATGCTGTCAGCATATTTGGC 

ACCACTACCAGCCCCAGCCCCCTTTGCCCCGCCCCCAACACACACACACACACACAC 

ACACACACACACACACACACACACACACACACACACACCTTTACCTTCTCCTGGGCA 

TCATCTGCTCACTCACCCACCCAAGCITAATCCTTTTCCTTCCCTGCAATAGTACCTC 

TCCTATTTTTATGTCTAGGTTCCCCCTCCCCCTGTTAGGAGATGGGAGAGGTCACGA 

AAGAAAGGAATTTGTAGCCCTTGAGCCAGCCCGGGCCACAGAGCCTGCCACCAGAC 

AGGAAAAGCCCAGGGCTTACCAGCACAGGAGGAGCAAACTCGCAGGCGAGCCTGG 

GTTGGCGCTGGTGGTCCCGGGTCGATGGCCCGCCCATTCCCAGAAGCCGAGGCTATA 

GCTGCGTCACCTGCCCCGCCCTCCTCCCGAGTGAAGACCCCTAGAGGCTGAGCAGAC 

CCCAAAGGCGGTGCAATTCCATTGGCCCAAGGCAGAGGTGAGCGGCTGCTAATCCC 

CTCGGGAAGTGAAGGGACCCAGAGAGTCTGGTAGATGTGGGAGCTGGGGTTCAGGG 

CGAGACAGAGGGTGGGATGGGCAGAAGGGTCCAGGAAAGGAAAGTACTGGAGGGG 

AGTTGGGACAAAAGCAGCGACCAAGGGAACATCGCTTCAGTGACTGAAGCCAGGCA 

AAAGGAGCGGGAAGGATTATATGTAGCCTGGGACGCTTTCATAAACACTGATGACG 

TGTTTGTGCAAAGCAAGCAATTTGAGGAGAAACGCCTGGGACGTCGGAAAGAAGGA 
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GTGATCGATTAGTACTTGTAAGTTTAGGTGAGTTTGAGAACTAACTAACCTATACT 
TTGAGGGAGAAGGAAGAGCATTCCAGCAGCAGCAGCAGCAGCAGCAATCAGATAA 
AGGAAAGCTITGGTTAGTTTGGAAATGTATGATACCATTAAAATAACAGAAGCGCCT 
CCAGrrCTCTGAAGAGTCAGlCCCCCAGCTAGTGAAGACTAAGCCrAC'rAAGCCITr 
TGCTCCCGTTGGAAGCAAAGAACG1TCCTTCAATCAGGTGAAGGCTCTCCTCAGAAG 
ATTTCCTGTCTCTGCTTATGTTACAAGAGGATTCAAAAGCAAGACAGAAGAGCTCAG 
GTATTGCCAACTCTTTTGTTAAATACAGTTTGAGGCTTAAGTGTACGGGAACTCATG 
TGGlATrcAllTACGGCTCl'CTTCTCITATAACTAACTCrrAAGGTGCATATAGTCTC 
TrCTGTTTCCCAGCTACCTTGTACCATCTTTGTTTATCTAATAATAGCAAGCTCATCT 
GOTITrAATCATCACGCAGAGAGTATTCAAAAATATrcAGTOATGTAACAGTGACA 
GTGTAGGCATAGAAGTAATCATTAGTAAATCTTAATTTGGGTTAAACTCATTCATAA 
n CAGCTCCAGGTTGGGAGGGATCACTGAGCCTTCGCCACGTGCGGGTTAAAGATAnT 
5 TCTAACAAGAGAAGCAGAATTCTTCCTTGGCCATGCTCCCCATCACTGTGTCAGTAA 
q GCAGAGGGGTGTTTCCAAGCAGAGAAAGAGCAGACAGTGTTATGCCTGCAAAGTCA 
13 GAGACTCAGCCCTCCCAGCTGGTCAGTTTACTGTCCTCCCGGTCATTAGTTGGCTCTG 
jfl AAAAGGCCCATGTGTCCTTATTGGCAAGGACTTGCAGACATGCTAGAAAGAAATTT 
£ GACCirillirCTAGTGGGTTATTACAGCTGTAAAAGTATTTTGGAAGGTTAAGCCA 
| V AATAA ATAAAACACATATTAA ATAATACAATGTTACAAA AATTGATCATATAAAGA 

m AGTACATTCATAAATGCAATGTGAAAAATATATATAATTTTTATCTATTTACTGGTGC 
U AAAGTTTTCTAAATTGCACATGTACTATTTTTATATTTATAAAAATATT^ 
iil TATATAAAAGTGTAAAAGGCTCTTGGTCAAACAAGAGAGTTAAATTTACAAACTTTA 
m ATTGTCCCGATAACATTATTATGATCTCTAATGACAGGGATCCTGCTTTTCATTGGGA 
fU AATGAGAAGCTATGAAGATATGTTTACAATAATAAGCCCATTTAGTGATAAAGTCCA 
Q ATGGGAAGCTAGCACACACTGGTTTATAAAGAGAACAGTTTCCTGAGTCTATGCAA 
ly GTTTACACTCTAGGGAATAAGAGTTCCTCTTTCTCCAGATTTCACTAGCATTTGTTGT 
CATCATTTATCTTCTTGATGATGAGCATTATAAGTGGAATAAGATAGGATCTCAAAG 
GAATGTCAATTTGGATGCCCTGAACAATCTTTCAGGTCTTTCITTCAGTTCACTAG 
TAlTCATTrATTGGATAAITGGGGGGATGGTGGTAATTTTTTTGCAGTl'CTTATGGAA 
TTCCAAAAAACAAAAAACAAACCAACCAACCAAAAACCTCTGAAACTAGAACTACC 
AATCCATTACTGGGTATGTAACAAAGAGAAATCTGCACAGAATTTATTGCTACATTG 
TTCATTATTCACGACAGCCAAGAATGTGGAACCAACTTACGTAGCCGTCAAAATATG 
AACGGATAAAGAAAATGTGGAAATGTGTACAACAGAGTCCCATGTGGCCATAAAAG 
AGTGAAATCATGACATATGCAGGAAATGGATGCAACTGGAAATCAATTGGGCTAAT 
CAAAACAAGACAGACTCAAAAAGGAAACACCGTGTAGCTTCTCTGACAAACAGAAG 
CTAGATTTACACTTGTACGTGCGCATGTGTGTTTAGAATTTTATTTAGTTATACACTA 
TTCTAATCTGTGAGTGTGTATAAAGGCATGCATGTAAAGCAAAAACAAGCTAGCTG 
GGGTGGGTAGGAGAGAAAGCAATGAGAGGAGTTAATAAGAACGAAGCATAGTAAC 
ATAGGTGCCAGGATGAAATGCATTAATTTGTATGCTAACTAAACCACAGACAGGAG 
GCACACGTTCAAACCAGGGTGAAATCCCAGCACAGAGAAGGGGAAGTAGACACAA 
AGTTTCGCCACTAACCAAGAAGCCATTTGCAGTTGCTGCCTGCTGGGAAGGGGCGTT 
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CCAGTTTTCTCCAGTCTGACACTGTGTATAACAACCAGTTGACAATACAAAGTTGGC 

AlGATGGAl'GGTrnTGTGCrAlTIllCATlTrnilCrrAC'rG'lTlTGTTGTTGTGGT 

GGTTGTTGTGGTGGTGGCTGTGGTTTTCATTTGTTTCim 

ATGAAATTGGGTGGGTAGGAAGCTGGAAACGATCTGGAAGAAGTTGGGGAAAGAG 

AAAAATTGTATGGAGCATATTTAAACAAACAAACAAACAAACAAAAGGTTCATTTT 

GCCACAAAAAGGTGTGAATTAAATTAACCAGTTACGACTCTTAAAGAAAATATTCCC 

AATTATTCCCAGAGTTGCTATGTATGCTGTGCCTAGGACTTTGCTTGAACTGGCCCTA 

TAACTCTGGTGTGGTGTCTTTTCAGGATGCAGAAGAGAGGCAGGGAAGTCAGCTGCT 

TGCTGATCTCCCTCACTGCCAl'CrGCCTGGTGGTCACCCCTGGGAGCAGGGTCTGTC 

ctcgccgatgtgcctgctatgtgcccacagaggtck:actgtacatttcgggacctga 

CCTCCATCCCAGACGGGCATCCCAGCCAATGTGK}AACGAGTCAATTTAGGGTGTGTG 
GACCTTGCCTGATCTCCTrcrCAG^ 

CCAAACACCTGTGATTACTTTTAATAGTTTTCTTCTAAAATGGGTTCATACA^ 

ATATTGTGGAGACAATGAACATTTTATCCCAATAGTCTTTTACTAGAACTTGAAGCC 

CCTCITAGTTGlTTGGGAGCCTCATAArrATGGGGCAGClTTATTCTGAATGAArrrr 

AAATGAAAAAGATACAGTTTCTGTTAACAATCATTATGATACCAAGGAAGAGGAAT 

TGTCATTGAATATlTrAAAAAAGCATTTCTTTTGCAATTTATAAAT^^ 

ATGGCTTACTTAAAATACTTGCCTTACTAAATCTGACAAATTATGGTGATATTTTGAA 

GGT1TATGAAAATTTGTTTATGTGTATAAATGCACAAGAAATGGGATATGCCATCAC 

CTATGTGCCATTAGTGAGCATGTACAGTATGCCAAACACTATTGTTCACGTTTGGAG 

GAAGTAATGGGGGTGGGGGAGCAACAAGGGTTATAACCGTATACCCAGTGCCTTGG 

AAGCGATTGCAAACAGTAAAGACTGACATTGTGTTCTCCCTATGAGGGAGGGGCCTT 

GGGCTGAGCACTTTGCAATGAGCATTTGCTCATTGTGCTGGCAGGTTTTATGATAAC 

TTGACCCAAGCTAGAGTCACTGGAGAGGAAGGAACTTCAACTGAGAACATGCCTGA 

AGAAGATCAGATTATAGGCAGGCCTGTGGGGCATTTTCTTAATTAGTGATTCATGGG 

GCAGGGCCCAGTCCATTGTTCGTGGTACCATTTCTCAGGCACTATTAAAAAAAAAAA 

AACAGGCTGAGCAAGTGTCAAGGAGCAAGTCAGTGAGCAGCAGCCCTAATGATCTC 

TGCATCAGCTCCTGCCTCCAGGTOCTACCCTAlT'rGAGTTCCTGTCCTAGCTCCCTA 

CAGTGATGAACAATGATGTGGAAGTATAAGCCAAATAAATCCTITCITCCCCAACTT 

GCTGTTGGTCATGATGTTTCATCACAGTGATAATAGTCCTCATGAAGATGCTGGTGT 

TTATAACACCTTTGGACTAAATTCTGTTATCTATAGCTGAGGAAAATGGAGCATAGA 

AAGTCTCCAGACTACACCAGAGTGTAATCTGGGCCTGAGCTTAGAATCACACCCAC 

GTGCACTCCACTGCCGGGGCTrCTTAACCGGAACACAGTTGTAAAAGGGAATriTCT 

GTTTGTTTCCATTTTGACATGTGGACTTTAATTGACGATTCATCTGAAGCT 

AiiriiiTTCCAGGTATAACAGCCTCACTAGATTGACAGAAAATGACTTTTCTGGCCT 

GAGCAGACTGGAGTTACTCATGCTGCACAGCAATGGCATTCACAGAGTCAGTGACA 

AGACCTTCTCGGGCTTGCAGTCCTTGCAGGTGAGATAGGTAGAGGGTGATGGAGGC 

TGAGAAGAGAGGTGCAACTGTGGGTTATACCCAAAAGCTGCTGATTCCCGTGGGAG 

ACATTCTATAAGCATTCTATAAACTAGAGGCAGATATCAAGGAAGGATTTCAATTGT 

AATGCAATTTTATGAGAAAATTTGAATATTAAGAAAATGCTGGGGAAAATGCTTAC 
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ACAATTGCGAGGACCTAATTTAGGATCTCCAATAGCCACATAAAAAGCACAGCATG 

GCGGCAGACACCTGCAATTCCTGTCCCTGGAAGCACCTGTTCAGAATCCCAGAGACT 

CATTGGCCAAACACTCTATTCAATCAATGAAGTCCATATTCAGTGACAAAACTTGAC 

TCAGAAACTAATGTGGAAAGCATCAGGAAGACAGCCAACATCTGGTCTCTACTCAT 

GCATGAATAAGGGATCCCAGAGAGAAGGGAAGAAAAAGGAAGGAAGGAAGGAAG 

GAAGGAAGGAAGGAAGGAAGGAAGGAAGGAAGGAAGAGAGGGAGGAAAGGAGGG 

AGGGAAGGAAGGAAAGGGAAAGGAAAAAAGAGATGGGGAGGGAGGGAAGGAAAG 

GAAAGGGGGAGAAAGAAGAGAAGAAAGGAAAATAAATAAATTTTCAGGGATTATT 

ACACCTTTAAATITrATCCATAAAAGGTCATTTCCACCTGTTTGTCTGGAA 

GGGATCCCTTATATAAGGGCAGTCTITAACATAGTAGCATTTTATAAACCA 

ATlTl^GAGlTlTCTCTACTfTTrATCCTCTACCATCrrCAAACTGAAACTACAArrATT 

cccacaaatgaagaaaatgctgtaagagtitrcacacaccgaagtgggaaacttaa 

ggattagacaagtctaacaatgagaatggggagaacaaaaagagactgcacaggg 

agccctttctctgcttataatcttgacacttgagaagctaattgacgctgcatgacta 

ctcaacrctttaagcaaacaatgctgttgttcatgaaaagcacaataaagtacatat 

gtcccataatattcatcaaaatttgcatgcagcacataatagcaatcaaagcaataa 

cacccactgttcacagagactttaaacatgaaacTggaactatgtctagtgttttga 

cttagggtacatagtatgctgtgtctgtatgtaccaatgttgatttaggtcatcaga 

cagcatttggaacatgtatcttcaggaggaatcattcatgtatcctgcatgaaattc 

tccacctatgtttattctcttagccaggtititctctgatggagaaacatrgggtttg 

aggttttactcccaggtaacatttagggaaaagctgtctatgttctcagtttg 

tatttatgagggatgttggtattccagaaaattctcttttgaagagattacaattta 

ggtcaaaacagaaaaatatgtaaaaagttattgtttttattagtatttcatgttctw 

aaataatcagagagggatitgatgaatgctaaagcatcatgaaaaattcaaaattt 

tttgcttctaattcagaatcaattaaattcatattactataaaagacagcacgccag 

atgtgtgccagctgaggagtggataaactgtgtaacgtgagtgctatgtagaaaca 

gaaaggagtgaagggttgatgtgcgctgcaacatcttgaaaacattcggctacatg 

atggaagccaggcacaaaaagccacatattgcatggttatgtttatatgaaatgttt 

aaaatacatggattcttagcaaacagagtaagatgttacttagggtcaggaaaaga 

ttaaaaaaaaaaaaactattgatgtggaatgatcttaatrtggggaaaagacaatt 

tcctaagacgaaatagttgaggtagatatagttatatccctgtggatattgtaataa 

accagcatgctgtgctctgagaagggcctaatgaaggggcaggaggaagtgaaatg 

agatggtagaaaggaaagtcatataccatggcttctctcgtgggtggaatctagat 

atgttaatatattgacataaaggaaggaattgtttagggaaggatcaaaaccaaca 

ggagtgagggagacaataggaaccaatgagaggcaaagttcatggtcaatgtgtgt 

ggagacaccataataaaactccttttttgtttgctaactaaaaccactaaaatctaa 

aaacaaaacatttttgcacaagaattatttattattcaataaagatgtttaaatggg 

ggaagttgaagttcattgatagtctcataaatcttaaatgtatttaaactgctrttta 

cgttttttattattaattactcttgctgtcattattatcatcatcattatcgtcatcat 
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CATCACTAATGCTTTTCACCATACACAAATGTAGGCAGAAGAGTGTAATCCACTTAG 

TGAGGCAATCTTGGAGAGGGAAAGGAAGCGGATGCGGGGCAGAGGCACACAGGAG 

GACAGTGAGAGGGAAATGAACAAGAAAAAATGTGGACACATGCACAAAAATTCCA 

TAGTCCACTACATTACTITGTATTCTAATATTAAGAAAATAATAAACCCATTTCTGTG 

CACTTATCACCCAGGCTCAACAGTTATOTGGCCACAGATCCTGTCTCACTGCATCCT 

GTCCACCTGAGTCCACTTAGCGTTCTGAATCCAATCCACKjGCATGATGCITACTOT 

ACACAGAACTAAAGATTAAAGAGAGTrTAAAAGTAACCATGACATCTCTCTGTTCCT 

TTAGCGATAAGTTOTAATATTTATGGCTGCTTGTGTATGTTCTAATTTCTCTAATATT 

GTCACATTTAGTTGGCAACTACTTTGTTTGAATTGAGTTGGAGTTAAGGTCCCATAG 

GATTAATCTCAACATATTTCTATATTTATAAACTlTrCTCTCTITG 

GAGAAAACAAATATGCCCATATCmOTTACAGGTCTTAAAAATGAGCTATAACAA 

AGTCCAAATAATTGAGAAGGATACTTTGTATGGACTCAGGAGCTTGACCCGGTTGCA 

CCTGGATCACAACAACATTGAGTTTATCAACCCCGAGGCGTTTTACGGACTCACCTT 

GCTCCGCTTGGTACATCTAGAAGGAAACCGGCTGACAAAGCTCCATCCAGACACAT 

TTGTCTCTTTGAGCTATCTCCAGATATTTAAAACCTCCTTCATTAAGNACCTGTACTT 

GTATGATAACTTCATTGACCTCCCTCCCAAAAGAAATGGTCTCCTCTATGCCAAACC 

TAGAAAGCCTTTACTTGCATGGAAACCCATGGACCTGTGACTGCCATTTAAAGTGGT 

TGTCCGAGTGGATGCAGGGAAACCCAGGTAACTATCTTGTTTGTTTGTTTCTTITm 

ATARKACGTATTTTCCTCAATTTCATTTAGAATGATATCCCAAAAGTCCCCCATAACC 

TCCCCCCCACTrCCCTACCTACCCATTCCCATTTTTTGGCCCTGGCATTCCCCTGTACT 

GGGGCATATAAAGTTTGCGTGTCCAATGGACCTCTCTTTCCAGTGATGGCCAACTAG 

GCCATCTTTTGATACATATGCAGCTAGAGTCAAGAGCTCTGGGGTACTGGTTAGTTC 
ATAATGTTGTTGCACCTACAGGGTTGAA (SEQ ID NO:20) 
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MPKRAHWGALSVVLE.LWGHPRVALACPHPCACYVPSEVHCTFRSLASVPAGIARHVE 

RINLGFNSIQAI^ETSFAGLTKLELLMfflGNEIPSIPDGALRDI^SLQWKFSYNKLRVITG 

QTLQGI^NLMRLHIDHNKIEFIHPQAFNGLTSLRT ,T ,HT .EGNLLHQLHPSTFSTFTFLDYFR 

I^TIRHLYLAENMVRTLPASMIitf^ 

KCKKDKAYEGGQLCAMCFSPKKLYKHE^^ 

EEDGGSQLILEKFQLPQWSISIJ>MTDEHGNMV^ 

AWAIX>ie:PMTRENYEKLWK^ 

yytgvraqilaepewvmqpsidiqlnrrqstakkvllsyytqysqtistkdtrqargrs 
wvmepsgavqpjdqtvleggpcqlscnvkase5psifwvlpdgsilkapmddpdskfsil 
ssgwlriksmepsdsglyqciaqvrdemdrmvyrvlvqspstqpaekdtvtigknpges 
wijpcnaiaipeahlswiu'nr^ 

nqqgadhftvgitvtkkgsglpskrgrrpgakai^rvp^divedeggsgmgdeentsr 

rllhpkdqevflktkddaingdkkakkgrrklk1.wkhsekepetnvaegrrvfesrrr 

inmankqinperwadn.akvrgki^kgtevpplikttsppsi^i^vtppfpavsppsasp 

vqt vts aees s advpllgeeehvlgtis s asmgi£hnhng velvepe vtstpleev vddl 

sekteeitstegdlkgtaaptlisepyepsptlhtldtvyekptheetategwsaadvgss 

peptsseyeppijmvslaesepmqyfdpdijehxsqp^ 

stsqmdstigepgvpgqshl(^ltdmhlvksslstqdtllikkgmkemsqtlqggnm 

ijegdpthsrssesegqf^ksrrijdstlgmssmspvkkpaettvgtlldkdtttvtttpr 

qkvapsstmsthpspvrpvpngpjrpjjrpnkfrhrhkqtpptifapset^ 

qvesslvptawvdntvntpkqlemeknaeftskgtprrkhgkrpnkhrytpstvssra 

sgskpspspenkhrnwtpssemiprtvslktegpydsi^ymtt^ 

wykptsdgkeiki>dvatnvdkhksdilvtgesitnaiptsrslvs™gef^esspvgfp 

gtptwnpsrtaqpgpxqtdipvttsgenltdppllkeledvdftseflssltvstpfhqee 

agssttlssikvevassqaetttldqdhi£ttvaillsetrpqnhtptaarmkepasssps 

mmslgqttttkpalpsprisqasrdskenvflnyvgnpeteatpvnnegtqhmsgpne 

i^tpssdrdai^stki£i£kqwgsrsijprgpdsqrqdgrvhashqltrvpakpe.pta 

tvripemstqsasryfvtsqsprhwtnkpfjttypsgalpenkqf^ 

SKPSffSKFTDRRTDQFNGYSKVFGNNNIPEAPvNPVGKPPSPRffHYSNG 

PQLGWRRPQIPTSPAPVMPJERKWGSYNPJHSHSTFHLDFGPPAPPIXHTPQTTGSPSTN 

LQNBPMVSSTQSSISFITSSVQSSGSFHQSSSKFFAGGPPASKFWSLGEKPQILTKSPQTVSV 

TAETDTVFPCEATGKPKPFVTWTKVSTGALMTPNTRIQRFEVLKNGTLVIRKVQVQDRG 

QYMCTASNLHGLDRMVVLL^VTVQQPQ1LASHYQDVTVYI.GDTIAMECLAKGTPAPQI 

SWIFPDRRVWQTVSPVESRITLHENRTLSIKEASFSDRGVYKCVASNAAGADSLAIRLHV 

AALPPVIHQEKLENISLPPGLSIHIHCTAKAAPLPSVR^ 

NGTLYIRNLAPKDSGRYECVAANLVGSARRTVQLNVQRAAANARITGTSPRRTDVRYG 
GTLKlI)CSASGDPWPRILWPXPSKRMmALFSFDSRIKWANGTLVVKSVTDK 
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DAGDYLCVAl^VGDDYVVLKVDVVMKPAKffiHKEENDHKVFYGGDLKVDCVATGL 

PNPEISWSIPDGSLVNSFMQSDDSGGRTKRYVVFNNGTLYIMEVGMREEGDYTCFAEN 

QVGKDEMRWVKVVTAPATIRNKTYLAVQWYGDWTVACEAKGEPMPKVTWLSFrN 

KVIPTSSEKYQIYQDGTLLIQKAQRSDSGNYTCLVRNSAGEDRKTVWIHVNVQPPKING 

NPNPITTVimiAAGGSRIXmCKAEGIFrPRVLWAFPEGVVLPAPY^ 

RSLRKSDSVQLVCMARl^GGEAJRLrVQLTVLEPMEKPIFTIDPIS 

AAGTPTPSLVWVLPNGTDLQSGQQI^RFYHKADGMLfflSGI^SVDAGAYllCVARNAA 
GHTERLVSIJCVGLKPEANKQYHNLV^ 

PQTLGRVSLIJ)NGTLTVREASVFDRGTYVCRMETEYGPSVTSIPVIVIAYPPRITSEPTPVI 
YTRPGNTVKLNCMAMGIPKADITWELPDKSHI.KAGVQARLYGNRFLHPQGSLTIQHAT 
QRDAGFYKCMAKNILGSDSKTTYIHVF (SEQ ID NO: 21) 
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Figure 52 

ATGCCCAAGC GCGCGCACTG GGGGGCCCTC TCCGTGGTGC TGATCCTGCT 
TTGGGGCCAT CCGCGAGTGG CGCTGGCCTG CCCGCATCCT TGTGCCTGCT 
ACGTCCCCAG CGAGGTCCAC TGCACGTTCC GATCCCTGGC TTCCGTGCCC 
GCTGGCATTG CTAGACACGT GGAAAGAATC AATTTGGGGT TTAATAGCAT 
ACAGGCCCTG TCAGAAACCT CATTTGCAGG ACTGACCAAG TTGGAGCTAC 
TTATGATTCA CGGCAATGAG ATCCCAAGCA TCCCCGATGG AGCTTTAAGA 
GACCTCAGCT CTCTTCAGGT TTTCAAGTTC AGCTACAACA AGCTGAGAGT 
GATCACAGGA CAGACCCTCC AGGGTCTCTC TAACTTAATG AGGCTGCACA 
TTGACCACAA CAAGATCGAG TTTATCCACC CTCAAGCTTT CAACGGCTTA 
ACGTCTCTGA GGCTACTCCA TTTGGAAGGA AATCTCCTCC ACCAGCTGCA 
CCCCAGCACC TTCTCCACGT TCACATTTTT GGATTAJTTC AGACTCTCCA 
CCATAAGGCA CCTCTACTTA GCAGAGAACA TGGTTAGAAC TCTTCCTGCC 
AGCATGCTTC GGAACATGCC GCTTCTGGAG AATCTTTACT TGCAGGGAAA 
TCCGTGGACC TGCGATTGTG AGATGAGATG GTTTTTGGAA TGGGATGCAA 
AATCCAGAGG AATTCTGAAG TGTAAAAAGG ACAAAGCTTA TGAAGGCGGT 
CAGTTGTGTG CAATGTGCTT CAGTCCAAAG AAGTTGTACA AACATGAGAT 
ACACAAGCTG AAGGACATGA CTTGTCTGAA GCCTTCAATA GAGTCCCCTC 
TGAGACAGAA CAGGAGCAGG AGTATTGAGG AGGAGCAAGA ACAGGAAGAG 
GATGGTGGCA GCCAGCTCAT CCTGGAGAAA TTCCAACTGC CCCAGTGGAG 
CATCTCTTTG AATATGACCG ACGAGCACGG GAACATGGTG AACTTGGTCT 
GTGACATCAA GAAACCAATG GATGTGTACA AGATTCACTT GAACCAAACG 
GATCCTCCAG ATATTGACAT AAATGCAACA GTTGCCTTGG ACTTTGAGTG 
TCCAATGACC CGAGAAAACT ATGAAAAGCT ATGGAAATTG ATAGCATACT 
ACAGTGAAGT TCCCGTGAAG CTACACAGAG AGCTCATGCT CAGCAAAGAC 
CCCAGAGTCA GCTACCAGTA CAGGCAGGAT GCTGATGAGG AAGCTCTTTA 
CTACACAGGT GTGAGAGCCC AGATTCTTGC AGAACCAGAA TGGGTCATGC 
AGCCATCCAT AGATATCCAG CTGAACCGAC GTCAGAGTAC GGCCAAGA AG 
GTGCTACTTT CCTACTACAC CCAGTATTCT CAAACAATAT CCACCAAAGA 
TACAAGGCAG GCTCGGGGCA GAAGCTGGGT AATGATTGAG CCTAGTGGAG 
CTGTGCAAAG AGATCAGACT GTCCTGGAAG GGGGTCCATG CCAGTTGAGC 
TGCAACGTGA AAGCTTCTGA GAGTCCATCT ATCTTCTGGG TGCTTCCAGA 
TGGCTCCATC CTGAAAGCGC CCATGGATGA CCCAGACAGC AAGTTCTCCA. 
TTCTCAGCAG TGGCTGGCTG AGGATCAAGT CCATGGAGCC ATCTGACTCA 
GGCTTGTACC AGTGCATTGC TCAAGTGAGG GATGAAATGG ACCGCATGGT 
ATATAGGGTA CTTGTGCAGT CTCCCTCCAC TCAGCCAGCC GAGAAAGACA 
CAGTGACAAT TGGCAAGAAC CCAGGGGAGT CGGTGACATT GCCTTGCAAT 
GCTTTAGCAA TACCCGAAGC CCACCTTAGC TGGATTCTTC CAAACAGAAG 
GATAATTAAT GATTTGGCTA ACACATCACA TGTATACATG TTGCCAAATG 
GAACTCTTTC CATCCCAAAG GTCCAAGTCA GTGATAGTGG 
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TTACTACAGA TGTGTGGCTG TCAACCAGCA AGGGGCAGAC CATTTTACGG 
TGGGAATCAC AGTGACCAAG AAAGGGTCTG GCTTGCCATC CAAAAGAGGC 
AGACGCCCAG GTGCAAAGGC TCTTTCCAGA GTCAGAGAAG ACATCGTGGA 
GGATGAAGGG GGCTCGGGCA TGGGAGATGA AGAGAACACT TCAAGGAGAC 
TTCTGCATCC AAAGGACCAA GAGGTGTTCC TCAAAACAAA GGATGATGCC 
ATCAATGGAG ACAAGAAAGC CAAGAAAGGG AGAAGAAAGC TGAAACTCTG 
GAAGCATTCG GAAAAAGAAC CAGAGACCAA TGTTGCAGAA GGTCGCAGAG 
TGTTTGAATC TAGACGAAGG ATAAACATGG CAAACAAACA GATTAATCCG 
GAGCGCTGGG CTGATATTTT AGCCAAAGTC CGTGGGAAAA ATCTCCCTAA 
GGGCACAGAA GTACCCCCGT TGATTAAAAC CACAAGTCCT CCATCCTTGA 
GCCTAGAAGT CACACCACCT TTTCCTGCTG TTTCTCCCCC CTCAGCATCT 
CCTGTGCAGA CAGTAACCAG TGCTGAAGAA TCCTCAGCAG ATGTACCTCT 
ACTTGGTGAA GAAGAGC ACG TTTTGGGTAC CATTTCCTC A GCCAGCATGG 
GGCTAGAACA CAACCACAAT GGAGTTATTC TTGTTGAACC TGAAGTAACA 
. AGCACACCTC TGGAGGAAGT TGTTGATGAC CTTTCTGAGA AGACTGAGGA 
GATAACTTCC ACTGAAGGAG ACCTGAAGGG GACAGCAGCC CCTACACTTA 
TATCTGAGCC TTATGAACCA TCTCCTACTC TGCACACATT AGACACAGTC 
TATGAAAAGC CCACCCATGA AGAGACGGCA ACAGAGGGTT GGTCTGCAGC 
AGATGTTGGA TCGTCACCAG AGCCCACATC CAGTGAGTAT GAGCCTCCAT 
TGGATGCTGT CTCCTTGGCT GAGTCTGAGC CCATGCAATA CTTTGACCCA 
GATTTGGAGA CTAAGTCACA ACCAGATGAG GATAAGATGA AAGAAGACAC 
CTTTGCACAC CTTACTCCAA CCCCCACCAT CTGGGTTAAT GACTCCAGTA 
CATCACAGTT ATTTGAGGAT TCTACTATAG GGGAACCAGG TGTCCCAGGC 
CAATCACATC TACAAGGACT GACAGACAAC ATCCACCTTG TGAAAAGTAG 
TCTAAGCACT CAAGACACCT TACTGATTAA AAAGGGTATG AAAGAGATGT 
CTCAGACACT ACAGGGAGGA AATATGCTAG AGGGAGACCC CACACACTCC 
AGAAGTTCTG AGAGTGAGGG CCAAGAGAGC AAATCCATCA CTTTGCCTGA 
CTCCACACTG GGTATAATGA GCAGTATGTC TCCAGTTAAG AAGCCTGCGG 
AAACCAC AGT TGGTACCCTC CTAGACAAAG ACACCACAAC AGTAACAACA 
ACACCAAGGC AAAAAGTTGC TCCGTCATCC ACCATGAGCA CTCACCCTTC 
TCGAAGGAGA CCCAACGGGA GAAGGAGATT ACGCCCCAAC AAATTCCGCC 
ACCGGCACA A GCAAACCCCA CCCACAACTT TTGCCCCATC AGAGACTTTT 
TCTACTCAAC CAACTCAAGC ACCTGACATT AAGATTTCAA GTCAAGTGGA 
GAGTTCTCTG GTTCCTACAG CTTGGGTGGA TAACACAGTT AATACCCCCA . 
AACAGTTGGA AATGGAGAAG AATGCAGAAC CCACATCCAA GGGAACACCA 
CGGAGAAAAC ACGGGAAGAG GCCAAACAAA CATCGATATA CCCCTTCTAC 
AGTGAGCTCA AGAGCGTCCG GATCCAAGCC CAGCCCTTCT CCAGAAAATA 
AACATAGAAA CATTGTTACT CCCAGTTCAG AAACTATACT TTTGCCTAGA 
ACTGTTTCTC TGAAAACTGA GGGCCCTTAT GATTCCTTAG ATTACATGAC 
AACCACCAGA AAAATATATT CATCTTACCC TAAAGTCCAA GAGACACTTC 
CAGTCACATA 
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TAAACCCACA TCAGATGGAA AAGAAATTAA GGATGATGTT GCCACAAATG 
TTGACAAACA TAAAAGTGAC ATTTTAGTCA CTGGTGAATC AATTACTAAT 
GCCATACCAA CTTCTCGCTC CTTGGTCTCC ACTATGGGAG AATTTAAGGA 
AGAATCCTCT CCTGTAGGCT TTCCAGGAAC TCCAACCTGG AATCCCTCAA 
GGACGGCCCA GCCTGGGAGG CTACAGACAG ACATACCTGT TACCACTTCT 
GGGGAAAATC TTACAGACCC TCCCCTTCTT AAAGAGCTTG AGGATGTGGA 
TTTCACTTCC GAGTTTTTGT CCTCTTTGAC AGTCTCCACA CCATTTCACC 
AGGAAGAAGC TGGTTCTTCC ACAACTCTCT CAAGCATAAA AGTGGAGGTG 
GCTTCAAGTC AGGCAGAAAC CACCACCCTT GATCAAGATC ATCTTGAAAC 
CACTGTGGCT ATTCTCCTTT CTGAAACTAG ACCACAGAAT CACACCCCTA 
CTGCTGCCCG GATGAAGGAG CCAGCATCCT CGTCCCCATC CACAATTCTC 
ATGTCTTTGG GACAAACCAC CACCACTAAG CCAGCACTTC CCAGTCCAAG 
AATATCTCAA GCATCTAGAG ATTCCAAGGA AAATGTTTTC TTGAATTATG 
TGGGGAATCC AGAAACAGAA GCAACCCCAG TCAACAATGA AGGAACACAG 
CATATGTCAG GGCCAAATGA ATTATCAACA CCCTCTTCCG ACCGGGATGC 
ATTTAACTTG TCTACAAAGC TGGAATTGGA AAAGCAAGTA TTTGGTAGTA 
GGAGTCTACC ACGTGGCCCA GATAGCCAAC GCCAGGATGG AAGAGTTCAT 
GCTTCTCATC AACTAACCAG AGTCCCTGCC AAACCCATCC TACCAACAGC 
AACAGTGAGG CTACCTGAAA TGTCCACACA AAGCGCTTCC AGATACTTTG 
TAACTTCCCA GTCACCTCGT CACTGGACCA ACAAACCGGA AATAACTACA 
TATCCTTCTG GGGCTTTGCC AGAGAACAAA CAGTTTACAA CTCCAAGATT 
ATCAAGTACA ACAATTCCTC TCCCATTGCA CATGTCCAAA CCCAGCATTC 
CTAGTAAGTT TACTGACCGA AGAACTGACC AATTCAATGG TTACTCCAAA 
GTGTTTGGAA ATAACAACAT CCCTGAGGCA AGAAACCCAG TTGGAAAGCC 
TCCCAGTCCA AGAATTCCTC ATTATTCCAA TGGAAGACTC CCTTTCTTTA 
CCAACAAGAC TCTTTCTTTT CCACAGTTGG GAGTCACCCG GAGACCCCAG 
ATACCCACTT CTCCTGCCCC AGTAATGAGA GAGAGAAAAG TTATTCCAGG 
TTCCTACAAC AGGATACATT CCCATAGCAC CTTCCATCTG GACTTTGGCC 
CTCCGGCACC TCCGTTGTTG CACACTCCGC AGACCACGGG ATCACCCTCA 
ACTAACTTAC AGAATATCCC TATGGTCTCT TCCACCCAGA GTTCTATCTC 
CTTTATAACA TCTTCTGTCC AGTCCTCAGG AAGCTTCCAC CAGAGCAGCT 
CAAAGTTCTT TGCAGGAGGA CCTCCTGCAT CCAAATTCTG GTCTCTTGGG 
GAAAAGCCCC AAATCCTCAC CAAGTCCCCA CAGACTGTGT CCGTCACCGC 
TGAGACAGAC ACTGTGTTCC CCTGTGAGGC AACAGGAAAA CCAAAGCCTT 
TCGTTACTTG GACAAAGGTT TCCACAGGAG CTCTTATGAC TCCGAATACC 
AGGATACAAC GGTTTGAGGT TCTCAAGAAC GGTACCTTAG TGATACGGAA 
GGTTCAAGTA CAAGATCGAG GCCAGTATAT GTGCACCGCC AGCAACCTGC 
ACGGCCTGGA CAGGATGGTG GTCTTGCTTT CGGTCACCGT GCAGCAACCT 
CAAATCCTAG CCTCCCACTA CCAGGACGTC ACTGTCTACC TGGGAGACAC 
CATTGCAATG GAGTGTCTGG CCAAAGGGAC CCCAGCCCCC CAAATTTCCT 
GGATCTTCCC TGACAGGAGG GTGTGGCAAA CTGTGTCCCC 
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CGTGGAGAGC CGCATCACCC TGCACGAAAA CCGGACCCTT TCCATCAAGG 
AGGCGTCCTT CTCAGACAGA GGCGTCTATA AGTGCGTGGC CAGCAATGCA 
GCCGGGGCGG ACAGCCTGGC CATCCGCCTG CACGTGGCGG CACTGCCCCC 
CGTTATCCAC CAGGAGAAGC TGGAGAACAT CTCGCTGCCC CCGGGGCTCA 
GCATTCACAT TCACTGCACT GCCAAGGCTG CGCCCCTGCC CAGCGTGCGC 
TGGGTGCTCG GGGACGGTAC CCAGATCCGC CCCTCGCAGT TCCTCCACGG 
GAACTTGTTT GTTTTCCCCA ACGGGACGCT CTACATCCGC AACCTCGCGC 
CCAAGGACAG CGGGCGCTAT GAGTGCGTGG CCGCCAACCT GGTAGGCTCC 
GCGCGCAGGA CGGTGCAGCT GAACGTGCAG CGTGCAGCAG CCAACGCGCG 
CATCACGGGC ACCTCCCCGC GGAGGACGGA CGTCAGGTAC GGAGGAACCC 
TCAAGCTGGA CTGCAGCGCC TCGGGGGACC CCTGGCCGCG CATCCTCTGG 
AGGCTGCCGT CCAAGAGGAT GATCGACGCG CTCTTCAGTT TTGATAGCAG 
AATCAAGGTG TTTGCCAATG GGACCCTGGT GGTGAAATCA GTGACGGACA 
AAGATGCCGG AGATTACCTG TGCGTAGCTC GAAATAAGGT TGGTGATGAC 
TACGTGGTGC TCAAAGTGGA TGTGGTGATG AAACCGGCCA AGATTGAACA 
CAAGGAGGAG AACGACCACA AAGTCTTCTA CGGGGGTGAC CTGAAAGTGG 
ACTGTGTGGC CACCGGGCTT CCCAATCCCG AGATCTCCTG GAGCCTCCCA 
GACGGGAGTC TGGTGAACTC CTTCATGCAG TCGGATGACA GCGGTGGACG 
CACCAAGCGC TATGTCGTCT TCAACAATGG GACACTCTAC TTTAACGAAG 
TGGGGATGAG GGAGGAAGGA GACTACACCT GCTTTGCTGA AAATCAGGTC 
GGGAAGGACG AGATGAGAGT CAGAGTCAAG GTGGTGACAG CGCCCGCCAC 
CATCCGGAAC AAGACTTACT TGGCGGTTCA GGTGCCCTAT GGAGACGTGG 
TCACTGTAGC CTGTGAGGCC AAAGGAGAAC CCATGCCCAA GGTGACTTGG 
TTGTCCCCAA CCAACAAGGT GATCCCCACC TCCTCTGAGA AGTATC AGAT 
ATACCAAGAT GGCACTCTCC TTATTCAGAA AGCCCAGCGT TCTGACAGCG 
GCAACTACAC CTGCCTGGTC AGGAACAGCG CGGGAGAGGA TAGGAAGACG 
GTGTGGATTC ACGTCAACGT CCAGCCACCC AAGATCAACG GTAACCCCAA 
CCCCATCACC ACCGTGCGGG AGATAGCAGC CGGGGGCAGT CGGAAACTGA 
TTGACTGCAA AGCTGAAGGC ATCCCCACCC CGAGGGTGTT ATGGGCTTTT 
CCCGAGGGTG TGGTTCTGCC AGCTCCATAC TATGGAAACC GGATCACTGT 
CCATGGCAAC GGTTCCCTGG ACATCAGGAG TTTGAGGAAG AGCGACTCCG 
TCCAGCTGGT ATGCATGGCA CGCAACGAGG GAGGGGAGGC GAGGTTGATC 
GTGCAGCTCA CTGTCCTGGA GCCCATGGAG AAACCCATCT TCCACGACCC 
GATCAGCGAG AAGATCACGG CCATGGCGGG CCACACCATC AGCCTCAACT 
GCTCTGCCGC GGGGACCCCG ACACCCAGCC TGGTGTGGGT CCTTCCCAAT 
GGCACCGATC TGCAGAGTGG ACAGCAGCTG CAGCGCTTCT ACCACAAGGC 
TGACGGCATG CTACACATTA GCGGTCTCTC CTCGGTGGAC GCTGGGGCCT 
ACCGCTGCGT GGCCCGCAAT GCCGCTGGCC ACACGGAGAG GCTGGTCTCC 
CTGAAGGTGG GACTGAAGCC AGAAGCAAAC AAGCAGTATC ATAACCTGGT 
CAGCATCATC AATGGTGAGA CCCTGAAGCT CCCCTGCACC CCTCCCGGGG 
CTGGGCAGGG ACGTTTCTCC TGGACGCTCC CCAATGGCAT GCATCTGGAG 
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GGCCCCCAAA CCCTGGGACG CGTTTCTCTT CTGGACAATG GCACCCTCAC 
GGTTCGTGAG GCCTCGGTGT TTGACAGGGG TACCTATGTA TGCAGGATGG 
AGACGGAGTA CGGCCCTTCG GTCACCAGCA TCCCCGTGAT TGTGATCGCC 
TATCCTCCCC GGATCACCAG CGAGCCCACC CCGGTCATCT ACACCCGGCC 
CGGGAACACC GTGAAACTGA ACTGCATGGC TATGGGGATT CCCAAAGCTG 
ACATCACGTG GGAGTTACCG GATAAGTCGC ATCTGAAGGC AGGGGTTCAG 
GCTCGTCTGT ATGGAAACAG ATTTCTTCAC CCCCAGGGAT CACTGACCAT 
CCAGCATGCC ACACAGAGAG ATGCCGGCTT CTACAAGTGC ATGGCAAAAA 
ACATTCTCGG CAGTGACTCC A AAACAACTT ACATCCACGT CTTCTGAAAT 
GTGGATTCCA GAATGATTGC TTAGGAACTG ACAACAAAGC GGGGTTTGTA 
AGGGAAGCCA GGTTGGGGAA TAGGAGCTCT TAAATAATGT GTCACAGTGC 
ATGGTGGCCT CTGGTGGGTT TCAAGTTGAG GTTGATCTTG ATCTACAATT 
GTTGGGA AAA GGAAGCAATG CAGACACGAG A AGGAGGGCT CAGCCTTGCT 
GAGACACTTT CTTTTGTGTT TACATCATGC CAGGGGCTTC ATTCAGGGTG 
TCTGTGCTCT GACTGCA ATT TTTCTTCTTT TGCAAATGCC ACTCGACTGC 
CTTCATAAGC GTCCATAGGA TATCTGAGGA ACATTCATCA A AAATAAGCC 
ATAGACATGA ACAACACCTC ACTACCCCAT TGAAGACGCA TCACCTAGTT 
AACCTGCTGC AGTTTTTACA TGATAGACTT TGTTCCAGAT TGAC AAGTC A 
TCTTTCAGTT ATTTCCTCTG TCACTTCAAA ACTCCAGCTT GCCCAATAAG 
GATTTAGAAC CAGAGTGACT GATATATATA TATATATTTT AATTCAGAGT 
TACATACATA CAGCTACCAT TTTATATGAA AAAAGAAAAA CATTTCTTCC 
TGGAACTCAC TTTTTATATA ATGTTTTATA TATATATTTT TTCCTTTCAA 
ATCAGACGAT GAGACTAGAA GGAGAAATAC TTTCTGTCTT ATTAAAATTA 
ATAAATTATT GGTCTTTACA AGACTTGGAT ACATTACAGC AGACATGGAA 
ATATAATTTT AAAAAATTTC TCTCCAACCT CCTTCAAATT CAGTCACCAC 
TGTTATATTA CCTTCTCCAG GAACCCTCCA GTGGGGAAGG CTGCGATATT 
AGATTTCCTT GTATGCAAAG TTTTTGTTGA AAGCTGTGCT CAGAGGAGGT 
GAGAGGAGAG GAAGGAGAAA ACTGCATCAT AACTTTACAG AATTGAATCT 
AGAGTCTTCC CCGAAAAGCC CAGAAACTTC TCTGCAGTAT CTGGCTTGTC 
CATCTGGTCT AAGGTGGCTG CTTCTTCCCC AGCCATGAGT CAGTTTGTGC 
CCATGAATAA TACACGACCT GTTATTTCCA TGACTGCTTT ACTGTATTTT 
TAAGGTCAAT ATACTGTACA TTTGATAATA A AATAATATT CTCCCAA AAA AAAAA 
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ATGAAGGTAAAAGGCAGAGGAATCACCTGCTTGCTGGTCTCCTTTGCTGTGATCTGC 

CTGGTCGCCACCCCTGGGGGCAAGGCCTGTCCTCGCCGCTGTGCCTGTTATATGCCT 

ACGGAGGTACACTGCACATTTCGGTACCTGACTTCCATCCCAGACAGCATCCCGCCC 

7\ATGTGGAACGCATCAATTTAGGATACAACAGCTTGGTTAGATTGATGGAAACAGAT 

TTTTCTGGCCTGACCAAACTGGAGTTACTCATGCTTCACAGCAATGGCATTCACACA 

ATCCCTGACAAGACCTTCTCAGATTTGCAGGCCTTGCAGGTCTTAAAAATGAGCTAT 

AATAAAGTCCGAAAACTTCAGAAAGATACTTTTTATGGCCTCAGGAGCTTGACACGA 

TTGCACATGGACCACAACAATATTGAGTTTATAAACCCAGAGGTTTTTTATGGGCTC 

AACTTTCTCCGCCTGGTGCACTTGGAAGGAAATCAGCTCACTAAGCTCCACCCAGAT 

ACATTTGTCTCTTTGAGCTACCTCCAGATATTTAAAATCTCTTTCATTAAGTTCCTA 

TACTTGTCTGATAACTTCCTGACCTCCCTCCCTCAAGAGATGGTCTCCTATATGCCT 

GACCTAGACAGCCTTTACCTGCATGGAAACCCATGGACCTGTGATTGCCATTTAAAG 

TGGTTGTCTGACTGGATACAGGAGAAGCCAGATGTAATAAAATGCAAAAAAGATAGA 

AGTCCCTCTAGTGCTCAGCAGTGTCCACTTTGCATGAACCCTAGGACTTCTAAAGGC 

AAGCCGTTAGCTATGGTCTCAGCTGCAGCTTTCCAGTGTGCCAAGCCAACCATTGAC 

TCATCCCTGAAATCAAAGAGCCTGACTATTCTGGAAGACAGTAGTTCTGCTTTCATC 

TCTCCCCAAGGTTTCATGGCACCCTTTGGCTCCCTCACTTTGAATATGACAGATCAG 

TCTGGAAATGAAGCTAACATGGTCTGCAGTATTCAAAAGCCCTCAAGGACATCACCC 

ATTGCATTCACTGAAGAAAATGACTACATCGTGCTAAATACTTCATTTTCAACATTT 

TTGGTGTGCAACATAGATTACGGTCACATTCAGCCAGTGTGGCAAATTTTGGCTTTG 

TACAGTGATTCTCCTCTGATACTAGAAAGGAGCCACTTGCTTAGTGAAACACCGCAG 

CTCTATTACAAATATAAACAGGTGGCTCCTAAGCCTGAAGACATTTTTACCAACATA 

GAGGCAGATCTCAGAGCAGATCCCTCTTGGTTAATGCAAGACCAAATTTCCTTGCAG 

CTGAACAGAACTGCCACCACATTCAGTACATTACAGATCCAGTACTCCAGTGATGCT 

CAT^ATCACTTTACCAAGAGCAGAGATGAGGCCAGTGAAACACAAATGGACTATGATT 

TCAAGGGATAACAATACTAAGCTGGAACATACTGTCTTGGTAGGTGGAACCGTTGGC 

CTGAACTGCCCAGGCCAAGGAGACCCCACCCCACACGTGGATTGGCTTCTAGCTGAT 

GGAAGTAAAGTGAGAGCCCCTTATGTCAGTGAGGATGGACGGATCCTAATAGACAAA 

AGTGGAAAATTGGAACTCCAGATGGCTGATAGTTTTGACACAGGCGTATATCACTGT 

ATAAGCAGCAATTATGATGATGCAGATATTCTCACCTATAGGATAACTGTGGTAGAA 

CCTTTGGTCGAAGCCTATCAGGAAAATGGGATTCATCACACAGTTTTCATTGGTGAA 

ACACTTGATCTTCCATGCCATTCTACTGGTATCCCAGATGCCTCTATTAGCTGGGTT 

ATTCCAGGAAACAATGTGCTCTATCAGTCATCAAGAGACAAGAAAGTTCTAAACAAT 

GGCACATTAAGAATATTACAGGTCACCCCGAAAGACCAAGGTTATTATCGCTGTGTG 

GCAGCCAACCCATCAGGGGTTGATTTTTTGATTTTCCAAGTTTCAGTCAAGATGAAA 

GGACAAAGGCCCTTGGAGCATGATGGAGAAACAGAGGGATCTGGACTTGATGAGTCC 

7\ATCCTATTGCTCATCTTAAGGAGCCACCAGGTGCACAACTCCGTACATCTGCTCTG 

ATGGAGGCTGAGGTTGGAAAACACACCTCAAGCACAAGTAAGAGGCACAACTATCGG 

GAATTAACACTCCAGCGACGTGGAGATTCAACACATCGACGTTTTAGGGAGAATAGG 

AGGCATTTCCCTCCCTCTGCTAGGAGAATTGACCCACAACATTGGGCGGCACTGTTG 

GAGAAAGCTAAAAAGAATGCTATGCCAGACAAGCGAGA7\AATACCACAGTGAGCCCA 

CCCCCAGTGGTCACCCAACTCCCAAACATACCTGGTGAAGAAGACGATTCCTCAGGC 

ATGCTCGCTCTACATGAGGAATTTATGGTCCCGGCCACTAAAGCTTTGAACCTTCCA 

GCAAGGACAGTGACTGCTGACTCCAGAACAATATCTGATAGTCCTATGACAAACATA 

AATTATGGCACAGAATTCTCTCCTGTTGTGAATTCACAAATACTACCACCTGAAGAA 

CCCACAGATTTCAAACTGTCTACTGCTATTAAAACTACAGCCATGTCAAAGAATATA 
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AACCCAACCATGTCAAGCCAAATACAAGGCACAACCAATCAACATTCATCCACTGTC 
TTTCCACTGCTACTTGGAGCAACTGAATTTCAGGACTCTGACCAGATGGGAAGAGGA 
AGAGAGCATTTCCAAAGTAGACCCCCAATAACAGTAAGGACTATGATCAAAGATGTC 
AATGTCAA/^ATGCTTAGTAGCACCACCAACAAACTATTATTAGAGTCAGTAAATACC 
ACAAATAGTCATCAGACATCTGTAAGAGAAGTGAGTGAACCCAGGCACAATCACTTC 
TATTCTCACACTACTCAAATACTTAGCACCTCCACGTTCCCTTCAGATCCACACACA 
GCTGCTCATTCTCAGTTTCCGATCCCTAGAAATAGTACAGTTAACATCCCGCTGTTC 

agacgctttgggaggcagaggaaaattggcggaagggggcggattatcagcccatat 
agaactccagttctgcgacggcatagatacagcattttcaggtcaacaaccagaggt 
tcttctgaaaaaagcactactgcattctcagccacagtgctcaatgtgacatgtctg 
tcctgtcttcccagggagaggctcaccactgccacagcagcattgtcttttccziagt 
gctgctcccatcaccttccccaaagctgacattgctagagtcccatcagaagagtct 
acaactctagtccagaatccactattactacttgagaacaaacccagtgtagagaaa 
acaacacccacaataaaatatttcaggactgaaatttcccaagtgactccaactggt 
gcagtcatgacatatgctccaacatccatacccatggaaaaaactcacaaagtaaac 
gccagttacccacgtgtgtctagcaccaatgaagctaaaagagattcagtgattaca 
tcgtcactttcaggtgctatcaccaagccaccaatgactattatagccattacaagg 
ttttcaagaaggaaaattccctggcaacagaactttgtaaataaccataacccaaaa 
ggcagattaaggaatcaacataaagttagtttacaaaaaagcacagctgtgatgctt 
cctaaaacatctcctgctttaccacagagacAaagttcccctttccatttcaccaca 

CTTTCAACAAGTGTGATGCAAATTCCATCTAATACC*fTGACTACCGCTCACCACACT 
ACGACCAAAACACACAATCCTGGAAGTCTTCCAACAAAGAAGGAGCTTCCCTTCCCA 
CCCCTTAACCCTATGCTTCCTAGTATTATAAGCAAAGACTCAAGTACAAAAAGCATC 
ATATCAACGCAAACAGCAATACCAGCAACAACTCCTACCTTCCCTGCATCTGTCATC 
ACTTATGAAACCCAAACAGAGAGATCTAGAGCACAAACAATACAAAGAGAACAGGAG 
CCTCAAAAGAAGAACAGGACTGACCCAAACATCTCTCCAGACCAGAGTTCTGGCTTC 
ACTACACCCACTGCTATGACACCTCCTGCTCTGGCATTCACTCATTCCCCACCAGAA 
AACACAACTGGG f ATTTCAAGCACAATCAGTTTTCATTCAAGAACTCTTAATCTGACA 
GATGTGATTGAAGAACTAGCCCAAGCAAGTACTCAGACTTTGAAGAGCACAATTGCT 
TCTGAAACAACTTTGTCCAGCAAATCACACCAGAGTACCACAACTAGGAAAGCATCA 
TTAGACACTCCCATACCACCATTCTTGAGCAGCAGTGCTACTCTAATGCCAGTTCCC 
ATCTCCCCTCCCTTTACTCAGAGAGCAGTTACTGACACACGTGGCGACTCCCATTTC 
CGGCTTATGACAAATACAGTGGTCAAGCTGCACGAATCCTCAAGGCACAATCTCCAA 
ATGCCAAGTTCACAATTGGAACCACTCACTTCATCTACCTCTAATCTGTTACATTCT 
ACTCCCATGCCAGCACTAACAACAGTTAAATCACAGAATTCCAAATTAACTCCATCT 
CCCTGGGCAGAATACCAATTTTGGCACAAACCATACTCAGACATTGCTGAAAAAGGC 
AAAAAGCCAGAAGTAAGCATGTTGGCTACTACAGGCCTGTCCGAGGCCACCACTCTT 
GTTTCAGATTGGGATGGACAGAAGAACACAAAGAAGAGTGACTTTGATAAGAAACCA 
GTTCAAGAAGCAACAACTTCCAAACTCCTTCCCTTTGACTCTTTGTCTAGGTATATA 
TTTGAAAAGCCCAGGATAGTTGGAGGAAAAGCTGCAAGTTTTACTATTCCAGCTAAC 
TCAGATGCCTTTCTTCCCTGTGAAGCTGTTGGAAATCCCCTGCCCACCATTCATTGG 
ACCAGAGTTTCAGGACTTGATTTATCTAGAGGAAACCAGAATAGCAGGGTCCAGGTT 
CTCCCCAATGGTACCCTGTCCATCCAGAGGGTGGAAATTCAGGACCGCGGACAGTAC 
TTGTGTTCCGCATCCAATCTGTTTGGCACAGACCACCTTCATGTCACCTTGTCTGTG 
GTTTCCTATCCTCCCAGGATCCTGGAGAGACGTACCAAAGAGATCACKGTTCATTCC 
GGAAGCACTGTGGAACTGAAGTGCAGAGCAGAAGGTAGGCCAAGCCCTACAGTTACC 
TGGATTCTTGCAAACCAAACAGTTGTCTCAGAATCATCCCAGGGAAGTAGGCAGGCT 
GTGGTGACGGTTGACGGAACATTGGTCCTCCACAATCTCAGTATTTATGACCGTGGC 
TTTTACAAATGTGTGGCCAGCAACCCAGGTGGCCAGGATTCACTGCTGGTTAAAATA 
CAAGTCATTGCAGCACCACCTGTTATTCTAGAGCAAAGGAGGCAAGTCATTGTAGGC 
ACTTGGGGTGAAAGTTTAAAACTGCCCTGTACTGCAAAAGGAACTCCTCAGCCCAGC 
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GTTTACTGGGTCCTCTCTGATGGCACTGAAGTGAAACCATTACAGTTTAGCAATTGp" 

AAGTTGTTCTTATTTTCAAATGGGACTTTGTATATAAGAAACCTAGCCTCfT&AGAC 

AGGGGCACTTATGAATGCATTGCTACCAGTTCCACTGGTTCGGAGCGAAGAGTAGTA 

ATGCTTACAATGGAAGAGCGAGTGACCAGCCCCAGGATAGAAGCTGCATCCCAGAAA 

AGGACTGAAGTGAATTTTGGGGACAAATTACTACTGAACTGCTCAGCCACTGGGGAG 

CCCAAACCCCAAATAATGTGGAGGTTACCATCCAAGGCTGTGGTCGACCAGTGGAGC 

TGGATCCACGTCTACCCTAATGGATCCCTGTTTATTGGATCAGTAACAGAAAAAGAC 

AGTGGTGTCTACTTGTGTGTGGCAAGAAACAAAATGGGGGATGATCTGATACTGATG 

CATGTTAGCCTAAGACTGAAACCTGCCAAAATTGACCACAAGCAGTATTTTAGAAAG 

CAAGTGCTCCATGGGAAAGATTTCCAAGTAGATTGCAAAGCTTCCGGCTCCCCAGTG 

CCAGAGATATCTTGGAGTTTGCCTGATGGAACCATGATCAACAATGCAATGCAAGCC 

GATGACAGTGGCCACAGGACTAGGAGATATACCCTTTTCAACAATGGAACTTTATAC 

TTCAACAAAGTTGGGGTAGCGGAGGAAGGAGATTATACTTGCTATGCCCAGAACACC 

CTAGGGAAAGATGAAATGAAGGTCCACTTAACAGTTATAACAGCTGCTCCCCGGATA 

AGGCAGAGTAACAAAACCAACAAGAGAATCAAAGCTGGAGACACAGCTGTCCTTGAC 

TGTGAGGTCACTGGGGATCCCAAACCAAAAATATTTTGGTTGCTGCCTTCCAATGAC 

ATGATTTCCTTCTCCATTGATAGGTACACATTTCATGCCAATGGGTCTTTGACCATC 

AACAAAGTGAAACTGCTCGATTCTGGAGAGTACGTATGTGTAGCCCGAAATCCCAGT 

GGGGATGACACCAAAATGTACAAACTGGATGTGGTCTCTAAACCTCCATTAATCAAT 

GGTCTGTATACAAACAGAACTGTTATTAAAGCCAGAGCTGTGAGACATTCCAAAAAA 

CACTTTGACTGCAGAGCTGAAGGGACACCATCTCCTGAAGTCATGTGGATCATGCCA 

GACAATATTTTCCTCACAGCCCCATACTATGGAAGCAGAATCACAGTCCATAAAAAT 

GGAACCTTGGAAATTAGGAATGTGAGGCTTTCAGATTCAGCCGACTTTATCTGTGTG 

GCCCGAAATGAAGGTGGAGAGAGCGTGTTGGTAGTACAGTTAGAAGTACTGGAAATG 

CTGAGAAGACCGACATTTAGAAATCCATTTAATGAAAAAATAGTTGCCCAGCTGGGA 

AAGTCCACAGCATTGAATTGCTCTGTTGATGGTAACCCACCACCTGAAATAATCTGG 

ATTTTACCAAATGGCACACGATTTTCCAATGGACCACAAAGTTATCAGTATCTGATA 

GCAAGCAATGGTTCTTTTATCATTTCTAAAACAACTCGGGAGGATGCAGGAAAATAT 

CGCTGTGCAGCTAGGAATAAAGTTGGCTATATTGAGAAATTAGTCATATTAGAAATT 

GGCCAGAAGCCAGTTATTCTTACCTATGCACCAGGGACAGTAAAAGGCATCAGTGGA 

GAATCTCTATCACTGCATTGTGTGTCTGATGGAATCCCTAAGCCAAATATCAAATGG 

ACTATGCCAAGTGGTTATGTAGTAGACAGGCCTCAAATTAATGGGAAATACATATTG 

CATGACAATGGCACCTTAGTCATTAAAGAAGCAACAGCTTATGACAGAGGAAACTAT 

ATCTGTAAGGCTCAAAATAGTGTTGGTCATACACTGATTACTGTTCCAGTAATGATT 

GTAGCCTACCCTCCCCGAATTACAAATCGTCCACCCAGGAGTATTGTCACCAGGACA 

GGGGCAGCCTTTCAGCTCCACTGTGTGGCCTTGGGAGTTCCCAAGCCAGAAATCACA 

TGGGAGATGCCTGACCACTCCCTTCTCTCAACGGCAAGTAAAGAGAGGACACATGGA 

AGTGAGCAGCTTCACTTACAAGGTACCCTAGTCATTCAGAATCCCCAAACCTCCGAT 

TCTGGGATATACAAATGCACAGCAAAGAACCCACTTGGTAGTGATTATGCAGCAACG 
TATATTCAAGTAATCTGA 
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MKVKGRGITC LLVSFAVICL VATPGGKACP RRCACYMPTE VHCTFRYLTS 
IPDSIPPNVE RINLGYNSLV RLMETDFSGL TKLELLMLHS NGIHTIPDKT 
FSDLQALQVL KMSYNKVRKL QKDTFYGLRS LTRLHMDHNN IEFINPEVFY 
GLNFLRLVHL EGNQLTKLHP DTFVSLSYLQ IFKISFIKFL YLSDNFLTSL 
PQEMVSYMPD LDSLYLHGNP WTCDCHLKWL SDWIQEKPDV IKCKKDRSPS 
SAQQCPLCMN PRTSKGKPLA MVSAAAFQCA KPTIDSSLKS KSLTILEDSS 
SAFISPQGFM APFGSLTLNM TDQSGNEANM VCSIQKPSRT SPIAFTEEND 
YIVLNTSFST FLVCNIDYGH IQPVWQILAL YSDSPLILER SHLLSETPQL 
YYKYKQVAPK PEDIFTNIEA DLRADPSWLM QDQISLQLNR TATTFSTLQI 
QYSSDAQITL PRAEMRPVKH KWTMISRDNN TKLEHTVLVG GTVGLNCPGQ 
GDPTPHVDWL LADGSKVRAP YVSEDGRILI DKSGKLELQM ADSFDTGVYH 
CISSNYDDAD ILTYRITWE PLVEAYQENG IHHTVFIGET LDLPCHSTGI 
PDASISWVIP GNNVLYQSSR DKKVLNNGTL RILQVTPKDQ GYYRCVAANP 
SGVDFLIFQV SVKMKGQRPL EHDGETEGSG LDESNPIAHL KEPPGAQLRT 
SALMEAEVGK HTSSTSKRHN YRELTLQRRG DSTHRRFREN RRHFPPSARR 
IDPQHWAALL EKAKKNAMPD KRENTTVSPP PWTQLPNIP GEEDDSSGML 
ALHEEFMVPA TKALNLPART VTADSRTISD SPMTNINYGT EFSPWNSQI 
LPPEEPTDFK LSTAIKTTAM SKNINPTMSS QIQGTTNQHS STVFPLLLGA 
TEFQDSDQMG RGREHFQSRP PITVRTMIKD VNVKMLSSTT NKLLLESVNT 
TNSHQTSVRE VSEPRHNHFY SHTTQILSTS TFPSDPHTAA HSQFPIPRNS 
TVNIPLFRRF GRQRKIGGRG RIISPYRTPV LRRHRYSIFR STTRGSSEKS 
TTAFSATVLN VTCLSCLPRE RLTTATAALS FPSAAPITFP KADIARVPSE 
ESTTLVQNPL LLLENKPSVE KTTPTIKYFR TEISQVTPTG AVMTYAPTSI 
PMEKTHKVNA SYPRVSSTNE AKRDSVITSS LSGAITKPPM TIIAITRFSR 
RKIPWQQNFV NNHNPKGRLR NQHKVSLQKS TAVMLPKTSP ALPQRQSSPF 
HFTTLSTSVM QIPSNTLTTA HHTTTKTHNP GSLPTKKELP FPPLNPMLPS 
IISKDSSTKS IISTQTAIPA TTPTFPASVI TYETQTERSR AQTIQREQEP 
QKKNRTDPNI SPDQSSGFTT PTAMTPPALA FTHSPPENTT GISSTISFHS 
RTLNLTDVIE ELAQASTQTL KSTIASETTL SSKSHQSTTT RKASLDTPIP 
PFLSSSATLM PVPISPPFTQ RAVTDTRGDS HFRLMTNTVV KLHESSRHNL 
QMPSSQLEPL TSSTSNLLHS TPMPALTTVK SQNSKLTPSP WAEYQFWHKP 
YSDIAEKGKK PEVSMLATTG LSEATTLVSD WDGQKNTKKS DFDKKPVQEA 
TTSKLLPFDS LSRYIFEKPR IVGGKAASFT IPANSDAFLP CEAVGNPLPT 
IHWTRVSGLD LSRGNQNSRV QVLPNGTLSI QRVEIQDRGQ YLCSASNLFG 
TDHLHVTLSV VSYPPRILER RTKEITVHSG STVELKCRAE GRPSPTVTWI 
LANQTVVSES SQGSRQAVVT VDGTLVLHNL SIYDRGFYKC VASNPGGQDS 
LLVKIQVIAA PPVILEQRRQ VIVGTWGESL KLPCTAKGTP QPSVYWVLSD' 
GTEVKPLQFT NSKLFLFSNG TLYIRNLASS DRGTYECIAT SSTGSERRW 
MLTMEERVTS PRIEAASQKR TEVNFGDKLL LNCSATGEPK PQIMWRLPSK 
AVVDQWSWIH VYPNGSLFIG SVTEKDSGVY LCVARNKMGD DLILMHVSLR 
LKPAKIDHKQ YFRKQVLHGK DFQVDCKASG SPVPEISWSL PDGTMINNAM 
QADDSGHRTR RYTLFNNGTL YFNKVGVAEE GDYTCYAQNT LGKDEMKVHL 
TVITAAPRIR QSNKTNKRIK AGDTAVLDCE VTGDPKPKIF WLLPSNDMIS 
FSIDRYTFHA NGSLTINKVK LLDSGEYVCV ARNPSGDDTK MYKLDVVSKP 
PLINGLYTNR TVIKATAVRH SKKHFDCRAE GTPSPEVMWI MPDNI FLTAP 
YYGSRITVHK NGTLEIRNVR LSDSADFICV ARNEGGESVL VVQLEVLEML 
RRPTFRNPFN EKIVAQLGKS TALNCSVDGN PPPEIIWILP NGTRFSNGPQ 
SYQYLIASNG SFIISKTTRE DAGKYRCAAR NKVGYIEKLV ILEIGQKPVI 
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